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® (54) Title: MOLECULES INTERACTING WITH CASL (MICAL) POLYNUCLEOTIDES, POLYPEPTIDES, AND METHODS 
m OF USING THE SAME 

(57) Abstract: The present invention provides MICAL and MICAL-Like polypeptides and polynucleotides. Also provided are 
methods that for identifying agents that affect axon growth and placement. Furthermore, provided herein are methods for affecting 
1^" axon growth and placement. 
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MOLECULES INTERACTING WITH CASL (MICAL) POLYNUCLEOTIDES, 
POLYPEPTIDES, AND METHODS OF USING THE SAME 

STATEMENT OF GOVERNMENT SUPPORT 

[0001] This invention was made in part with government support under Grant Nos. 
NRSA-NS1 1055 and NS15165 awarded by the National Institutes of Health. The United 
States government may have certain rights in this invention. 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

[0002] This invention relates generally to polynucleotides encoding a family of 
polypeptides, and more specifically to polynucleotides encoding polypeptides having 
oxygenase activity and methods of use thereof. 

BACKGROUND INFORMATION 

[0003] A great deal of research has focused on identifying factors that underlie human 
pathological conditions and disease states. This research has focused on molecules that are 
important for normal human development to identify those factors whose function is 
compromised in human disorders. For example, during development, neurons form 
connections with one another and with other targets by extending processes called axons. 

[0004] In order to make developmental connections, axons navigate over long distances 
selecting their correct pathway, finding their appropriate target area, and establishing the 
proper connections with their target. The means by which axons accomplish this, remains 
largely unknown. It is clear however, that identifying the molecular signals that enable 
axons to form these connections is important for developing treatments for many 
neurological disorders including spinal cord injury. 
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[0005] Many of the same molecules implicated as functioning to guide the growing 
axon have been shown to function in cell adhesion, cell proliferation, cytoskeletal integrity, 
and other aspects of normal cell migration both in the nervous system and outside it. 
Characterization of these molecules will identify strategies for treatment of cell migration 
disorders including tumorigenesis. Therefore, there remains a need to identify molecules 
that function to guide growing axons. 

[0006] Following spinal cord injury in humans, axons fail to reestablish their 
connections, which results in paralysis and loss of sensation of the affected area. The 
factors that inhibit axons from reestablishing their connections are not known. It is 
interesting, however, that during development, inhibition of axon growth plays a role in 
forming the nervous system. Axons are guided to their targets by molecules that attract 
them as well as by those that inhibit (i.e., repel) them. These molecules help channel axons 
into appropriate areas, as well as prevent them from entering unwanted regions. However, 
these molecules remain largely unidentified. Therefore, there remains a need to identify 
molecules that inhibit or repel axon growth. 

SUMMARY OF THE INVENTION 

[0007] The present invention relates to a family of proteins, called MICALs, that are 
large, multidomain proteins expressed in axons, that interact with the neuronal plexin A 
receptor and are required for semaphorin la-PlexA-mediated repulsive axon guidance. In 
addition to containing several domains known to interact with cytoskeletal components, 
MICALs have a flavoprotein monooxygenase domain, the integrity of which is required for 
Sema-la-PlexA repulsive axon guidance. The presence of these domains suggest a 
previously unknown role for oxidoreductases in repulsive neuronal guidance. 

[0008] In one embodiment, the present invention provides an isolated polypeptide that 
includes a plexin interacting region and one or more of an N-terminal MICAL domain, a 
calponin homology domain, a LEVI domain, and a proline rich region, wherein the 
polypeptide has monooxygenase activity, plexin interacting activity, and/or axon guidance 
regulatory activity. The polypeptide can also include a first variable MICAL region and a 
second variable MICAL region, which can form part of the proline rich region. The 
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polypeptide can include, for example, from N-terminal to C-terminal, an N-terminal 
MICAL domain, a calponin homology doxmain, a first variable MICAL region, a LEVI 
domain, a proline rich region, and a plexin interacting region. 

[0009] In one aspect, the polypeptide is a mammalian polypeptide. For example, the 
isolated polypeptide can be human MICAL-1, human MICAL-2, or human MICAL-3. 
Accordingly, the polypeptide can include an amino acid sequence as set forth in 
SEQ ID NO:2 (human MICAL-1), SEQ ID NO:4 (human MICAL-2), or SEQ ID NO:6 
(human MICAL 3). 

[001 0] In another aspect, the polypeptide is a Drosophila MICAL polypeptide. For 
example, the polypeptide can have an amino acid sequence as set forth in SEQ ID NO: 8 
(Drosophila long variant) SEQ ID NO: 10 (Drosophila medium variant), or SEQ ID NO: 12 
(Drosophila short variant). 

[001 1 ] In another embodiment, the present invention provides a MIC AL-Like 
polypeptide. Accordingly, the isolated polypeptide includes a plexin interacting region and 
alternatively one or more of a calponin homology domain, a LEVI domain, and a proline rich 
region, and wherein the polypeptide interacts with a plexin. The polypeptide can also 
include a first variable region and a second variable region, which can form part of the 
proline rich region. For example, the polypeptide includes, from N-terminal to C-terminal, 
a calponin homology domain, a first variable MICAL region, a LIM domain, a proline rich 
region, and a plexin interacting region. 

[0012] In another aspect, the isolated polypeptide is a Drosophila MICAL-Like 
polypeptide. For example, the polypeptide can have an amino acid sequence as set forth in 
SEQ ID NO: 18 (Drosophila MICAL-Like polypeptide). 

[0013] In another embodiment, the present invention provides an isolated polynucleotide 
encoding a MICAL polypeptide or a MICAL-Like polypeptide of the present invention. 
The polynucleotide can encode a mammalian MICAL polypeptide, for example, human 
MICAL-1, human MICAL-2, human MICAL-3, human MICAL-Like 1, or human MICAL- 
Like 2. Accordingly, in one aspect, the polynucleotide is the coding sequence portion of 
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SEQ ID NO: 1 (human MICAL-1 cDNA), the coding sequence portion of SEQ ID N0 3 
(human MICAL-2 cDNA), the coding sequence portion of SEQ ID NO:5 (human MICAL 3 
cDNA), the coding sequence portion of SEQ ID NO: 13 (human MICAL-Like 1 cDNA), or 
the coding sequence portion of SEQ ID NO: 15 (human MICAL-Like 2 cDNA). 

[0014] The present invention also provides an isolated polynucleotide that selectively 
hybridizes to a polynucleotide encoding a MICAL polypeptide or a MICAL-Like 
polypeptide. 

[0015] In yet another embodiment, the present invention provides a method for 
identifying an agent that affects axonal guidance regulatory activity. The method includes 
contacting a polypeptide of the present invention that has axonal guidance regulatory 
activity, or a cell expressing the polypeptide, for example recombinant^ expressing the 
polypeptide, with a candidate agent. Next, axonal guidance regulatory activity or 
expression of the polypeptide is compared in the presence versus absence of the agent. A 
difference in activity or expression is indicative of an agent that affects axonal guidance 
regulatory activity. 

[0016] In another embodiment, the present invention provides a method for affecting 
axonal guidance regulatory activity. The method includes contacting a cell, for example, a 
neuron, that expresses a polypeptide of the invention such as a MICAL polypeptide, with an 
agent that alters MICAL activity and, thereby, affects axonal guidance regulatory activity. 
In one aspect, the method is performed in vivo and includes inhibiting axonal guidance 
regulatory activity by contacting the cell with an antioxidant that inhibits MICAL activity. 
The axonal guidance activity is a semaphorin-mediated axonal repulsion. As such, in 
another embodiment, the present invention provides a method for affecting a semaphorin- 
mediated process by contacting a cell that expresses a MICAL polypeptide of the invention 
with an effective amount of an agent that modulates MICAL activity and, thereby, affects 
axonal guidance regulatory activity. An agent is, for example, a small molecule, I 
polypeptide or fragment thereof, a peptidomimetic, or an antisense polynucleotide. 

[0017] hi another embodiment, the present invention provides a method for treating a 
neurological condition in a subject, that includes contacting in the subject, a cell of the 
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central nervous system or the peripheral nervous system, having a disrupted axonal 
connection or a cell that affects axonal growth of the central nervous system or peripheral 
nervous system cell, with an amount of an agent that modulates the activity or expression of 
a MICAL polypeptide, the amount being effective to modulate axon regulatory activity, 
monooxygenase activity, and/or plexin interacting activity. In one aspect, the neurological 
condition is a spinal cord injury. 

[0018] The present invention identifies exemplary flavonoids as agents that are used in 
methods of various embodiments of the present invention to inhibit axonal guidance 
regulatory activity. A variety of flavonoid anti-oxidants are known and are candidate 
inhibitors MICAL activity and, thereby, of axonal guidance regulatory activity such as 
semaphorin-mediated axonal repulsion. In one aspect of the invention, the flavonoids 
ECGC and EC and related gallic acid derivatives are inhibitors of semaphorin-mediated 
axonal repulsion. 

[0019] In another aspect, the present invention provides a method for inducing regrowth 
of an injured process of a neuron, that includes altering the levels of reactive oxygen species 
or other oxidation products in the milieu of the neuron. 

BRIEF DESCRIP TION OF THE DRAWINGS 

[0020] Figure 1 provides a molecular characterization of Drosophila MICAL, MICAL 
expression in Drosophila embryonic motor axons, and co-immunoprecipitation of MICAL 
with neuronal PlexA. 

[0021] Figure 1 A provides a schematic diagram of the MICAL locus. Variable exons 
are indicated with asterisks and produce: 1) a "long" isoform (4723 aa); 2) a "medium" 
isoform (3002 aa) (spliced out exon is shown with "X" though lines); and 3) a "short" 
isoform (2734 aa) (spliced out exons shown by thick angled exon connector lines). The 
regions corresponding to clones 23 and 15 1 are shown. 

[0022] Figure IB provides the domain organization of the Drosophila MICAL gene. 
MICAL is characterized by flavin adenine dinucleotide (FAD) consensus binding motifs 
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(GXGXXG, DG, and GD motifs), a calponin homology domain, a LIM domain, a Proline 
nch region, and a coiled-coil motif. 



are a 



[0023] Figure 2 provides schematic representations illustrating that the MICALs 
family of neuronally-expressed plexin-interacting proteins conserved from flies to 
mammals. 

[0024] Figure 2A is a schematic representation of the organization of the MICAL family 
of proteins. Amino acid identities are indicated among vertebrate MICALs and Drosophila 
MICAL (%s within domains) and between vertebrate MICALs (%s in arrows). The black 
regions indicate sequence that is not well-conserved among family members and variable in 
length (//). Regions encoded by an ORF situated in close proximity in the genome (~10kb) 
but for which cDNA sequence connecting them has not yet been identified are indicated 
(dots). 

[0025] Figure 2B provides a schematic representation of the domain organization of the 
MICAL-Like proteins. MICAL-like proteins have a similar domain organization as the 
MICALs but lack the N terminal -500 amino acid domain. Domain alignment and amino 
acid identity between Drosophila MICAL and MICAL-like proteins is indicated (within 
domains) and between MICAL-like proteins (within arrows). Available D-MICAL-L 
cDNA and genomic DNA sequence information suggests that the D-MICAL-L protein 
begins just N-terminal to the CH domain. Human MICAL-L1 and MICAL-L2 are similar 
in overall domain organization to D-MICAL-L and do not contain the highly conserved 
-500 amino acid MICAL N-terminal domain (dots indicate where molecular analysis is 
required to conclusively define the structural features of mammalian MICAL-L proteins; P, 
proline rich region; cc, coiled-coil). 

[0026] Figure 3 illustrates that the MICALs contain Apoprotein monooxygenase 
domains required for MICAL function in Drosophila. 

[0027] Figure 3A provides a schematic representation of three sequence motifs that 
define MICALs as flavoprotein monooxygenases. An alignment of MICALs with members 
of the flavoprotein monooxygenase family is shown in which (+) indicates that MICALs 
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match the consensus, (*) indicates that MICALs match the highly important conserved 
resumes, and (.) indicates the conserved spacing of these residues within these motifs 
Shadmg of sequences is based on ClustalX: conserved hydrophobic residues, cysteine 
rescues, acids, and bases are shaded in dark gray; conserved proline and glycine residues 
are mdicated with light gray shading. MICALs contain a 10 0 o/„ ma tch with the consensus 
ADP bmding region of FAD binding proteins (FAD Fingerprint 1), a well-conserved GD 
sequence (FAD Fingerprint 2), and a well-conserved DG motif: distinguishing features of 
flavoprotem monooxygenases. The proline ((*)) in the FAD fingerprint 2 is also likely to 
be conserved. In the upper consensus line, uppercase indicates an amino acid- h is a 
hydrophobic residue, s is a small residue (i.e., compact, zero, or few side chains); c is a 
charged residue, and x is any residue. 

[0028] Figure 3B is a spectral analysis of MICAL that illustrates that MICAL is an FAD 
binding protein. A bacterial fusion protein consisting of the Drosophila MICAL 
flavoprotein monooxygenase (FM) domain has an absorption peak at 452 nm and a shoulder 
at -358 nm (dashed line), consistent with that of 50uM free FAD (solid line) and similar in 
shape to spectra from other flavoproteins. 

[0029] Figure 4 illustrates that flavoprotein monooxygenase inhibitors attenuate 
vertebrate semaphorin axonal repulsion. Inhibitors of flavoprotein monooxygenases 
(EGCG and EC) as well as specific inhibitors of other oxidation/reduction enzymes 
including nitric oxide synthase (L-NAME), xanthine oxidase (allopurinol; Alio), and 
mitochondrial electron transport (NADH dehydrogenase; rotenone; Rote) were tested for 
their ability to inhibit semaphorin-dependent repulsive axon guidance in vertebrates. 

[0030] Figure 4A provides a schematic diagram of the rat D E14/15 Rat DRG explants 
were co-cultured with 293 cells expressing Sema 3A and grown for 48 hours, in the 
presence of an inhibitor or vehicle. Axonal outgrowth was determined by measuring 
proximal (P) and distal (D) axon lengths. 

[0031] Figure 4B provides a graph illustrating that redox inhibitors do not have adverse 
effects on expression of Sema 3A or its biological activity. Media was collected from 
untransfected 293 cells (No SemaSA) or cells transfected with AP-Sema3 A and grown in 
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the presence of vehicle (Sema3A), 25 m EGCG (3A/EGCG), 500 urn EC (3A/EC) 
500 um L-NAME (3A/L-NAME), 500 um Alio (3A/Allo), or 0.1 um Rote ( 3A/Rote) and 
ligand concentration (AP activity) was determined. The media was then diluted to InM (to 
remove the active concentration of the inhibitor) and its biological activity was assayed in a 
growth cone collapse assay (% Collapse; n > 60 growth cones per condition). The AP 
activity and percentage of growth cones collapsed were similar in the presence of all 
compounds. 

[0032] Figure 4C provides a graph that quantitates the effects of oxidation/reduction 
enzyme inhibitors on Sema 3A repulsion scored as the ratio of the axon lengths on the 
proximal and distal sides of the explant (P/D ratio), and on Sema3A-mediated growth cone 
collapse indicated as % collapsed growth cones (gray). In repulsion assays, outgrowth of 
DRG axons on the side distal to the 293 cells appeared normal. Attenuation of Sema 3A- 
mediated axonal repulsion was observed with the flavoprotein monooxygenase inhibitors 
(EGCG, and EC) in a dose-dependent manner but not with specific inhibitors of other 
oxidation/reduction enzymes. n's= number of DRG explants (repulsion assays) or number 
of growth cones scored (collapse assays; distributed over 4 different explants/condition). 
For Rote, n=4, however only 4 out of 12 explants survived. (**=p<0.0001; *=p<.001; 
paired t-test). Scale bar = 550 mm. 

[0033] Figure 5 shows results of a yeast interaction assay and identification of 
Drosophila MICAL. Figure 5 A provides a diagram of the Plexin A polypeptide. Figure 5B 
illustrates that clones 23 and 151 of the yeast interaction assay encode a novel PlexA 
interacting protein. Figure 5C provides a Northern blot of Drosophila total RNA using a 
probe that included a portion of clone 151. 

[0034] Figure 6 provides a series of graphic representations that illustrate the generation 
and characterization of MICAL Loss-of-Function mutants. Figure 6A Schematic of the 
screen to remove the MICAL locus by generating a small deletion between two P elements 
that flank MICAL. Figure 6B provides a table of summarizing genetic complementation 
analyses of lines exhibiting the stretch wing phenotype. Figure 6C provides a diagram that 
summarizes complementation analyses and genetic organization of the MICAL locus. Sizes 
are in kilobases (kb); non-continuous sequence is indicated by "//". Figure 6D provide 
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Western blot that illustrates that Df(3R)swp2^ is a MICAL null allele that produces no 
MICAL protein. Prominent bands are observed at 530kD, 330kD, 300kD, 200kDa, and 
125kDa in wild type and at stronger intensity in MICAL duplication embryo; none of these 
bands are observed in Df(3R )swp ^ embryos. Arrows indicate bands predicted from 
MICAL cDNA analysis (see text; Figure 1A). 

[0035] Figure 7 identifies various domains of the Drosophila MICAL medium variant 
polypeptide (SEQ ID NO: 10). Flavoprotein Monooxygenase domain is indicated by a 
squiggly underline; Calponin Homology domain is indicated by gray highlighting; MICAL 
Homology Region of Unknown Function is indicated by italics; LIM Domain is indicated 
by black highlighting; Proline Rich (Putative SH3 Ligands) are indicated by single 
underlining; Putative IQ (calcium) Binding Domain is indicated by dashed underlining; 
Putative Ena (Ena-like Proteins) binding Domain (Renfranz and Beckerle, Curr. Opin. Cell. 
Biol. 14:88, (2002)) (proline rich region) is indicated by underlining and italics; Plexin 
Interacting Region is indicated by bold; PDZ Ligand is indicated by double underline. 

[0036] Figure 8 provides a diagram that indicates the various domains of MICAL 
polypeptides and their amino acid residue numbers. Note: The MICAL 2 and 3 plexin 
interacting region is numbered backwards due to the lack of the intervening sequence 
denoted" ". 

[0037] Figure 9 provides a diagram that indicates the various domains of MICAL-Like 
polypeptides and their amino acid residue numbers. 

[0038] Figure 10 provides identifies various domains of the mouse MICAL-1 
polypeptide (SEQ ID NO:21). Flavoprotein Monooxygenase domain is indicated by a 
squiggly underline; Calponin Homology domain is indicated by gray highlighting; MICAL 
Homology Region of Unknown Function is indicated by italics; LIM Domain is indicated 
by black highlighting; Proline Rich (Putative SH3 Ligands) are indicated by single 
underlining; Putative IQ (calcium) Binding Domain is indicated by dashed underlining; 
Putative Ena (Ena-like Proteins) (Renfranz and Beckerle, Curr. Opin. Cell. Biol. 14:88,' 
(2002)) binding Domain (proline rich region) is indicated by underlining and italics; Plexin 
Interacting Region is indicated by bold; PDZ Ligand is indicated by double underline. 
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[0039] Figure 1 1 provides the cDNA (SEQ ID NO: 1) and encoded polypeptide 
(SEQ ID NO:2) sequence of human MICAL-1. 

[0040] Figure 12 provides the cDNA (SEQ ID NO:3) and encoded polypeptide 
(SEQ ID NO:4) sequence of human MICAL-2. 

[0041] Figure 13 provides the cDNA (SEQ ID NO:5) and encoded polypeptide 
(SEQ ID NO:6) sequence of human MICAL-3. 

[0042] Figure 14 provides the cDNA (SEQ ID NO:7) and encoded polypeptide 
(SEQ ID NO:8) sequence of the Drosophila MICAL long isoform. 

[0043] Figure 1 5 provides the cDNA (SEQ ID NO:9) and encoded polypeptide 
(SEQ ID NO: 10) sequence of the Drosophila MICAL medium isoform. 

[0044] Figure 16 provides the cDNA (SEQ ID NO: 1 1) and encoded polypeptide (SEQ 
ID NO: 12) sequence of the Drosophila MICAL short isoform. 

[0045] Figure 17 provides the cDNA (SEQ ID NO: 13) and encoded polypeptide (SEQ 
ID NO: 14) sequence of human MICAL-Like 1. 

[0046] Figure 1 8 provides the cDNA (SEQ ID NO: 1 5) and encoded polypeptide (SEQ 
ID NO: 16) sequence of human MICAL-Like 2. 

[0047] Figure 1 9 provides the cDNA (SEQ ID NO: 17) and encoded polypeptide (SEQ 
ID NO: 18) sequence of Drosophila MICAL-Like. 

[0048] Figure 20 provides MICALs in other species. The amino acid sequence through 
the Flavoprotein Monooxygenase Domain is shown for the species indicated. The numbers 
indicate the amino acid number for which it aligns with Drosophila MICA medium 
isoformXe.g., 53 aligns to Drosophila MICAL amino acid 53). Percent amino acid identity 
to the corresponding region of Drosophila MICAL is shown. 
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DETAILED IWsri?T P TI o N OF THE jkve^^ 

[0049] The present invention is based on the identification of a family of flavoprotein 
monooxygenases. This family of proteins is involved in the regulation of repulsive axon 
guidance. While not wanting to be limited by a particular theory, as illustrated herein, this 
protein family appears to regulate repulsive axon guidance by directly associating with 
plexins. Through this association, "MICALS" appear to be required for semaphorin- 
mediated repulsive axon guidance. Furthermore, MICAL proteins contain multiple domains 
that are known to be important for interactions with actin, intermediate filaments, and 
cytoskeletal-associated adaptor proteins. Therefore, MICALs are excellent candidates for 
directly mediating the cytoskeletal alterations characteristic of semaphorin signaling and 
provide novel targets for the attenuation of axonal repulsion. 

[0050] In one embodiment, the present invention provides an isolated polypeptide that 
includes one or more of an N-terminal MICAL domain, a calponin homology domain, a 
LIM domain, a proline rich region, and a plexin interacting region, wherein the polypeptide 
has monooxygenase activity, plexin interacting activity, and/or axon guidance regulatory 
activity. The polypeptide can also include a first variable MICAL region and a second 
variable MICAL region, typically surrounding the LIM domain. The second variable region 
in certain aspects forms a portion of the proline rich region and can include the LIM 
domain. Accordingly, in one aspect, the polypeptide is a MICAL polypeptide. A MICAL 
polypeptide includes the following domain organization from N-terminal to C-terminal: an 
N-terminal MICAL domain, a calponin homology domain, a first variable MICAL region, a 
LIM domain, a proline rich region, and a plexin interacting region. Furthermore, a MICAL 
polypeptide has monooxygenase activity and interacts with a plexin, typically plexin A. 

[0051] The polypeptide can be a mammalian MICAL polypeptide. For example, the 
isolated polypeptide can be human MICAL-1, human MICAL-2, or human MICAL-3. 
Accordingly, the polypeptide can include an amino acid sequence as set forth in SEQ ID 
NO:2 (human MICAL-1), SEQ ID NO:4 (human MICAL-2), or SEQ ID NO:6 (human 
MICAL 3). 
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[0052] The isolated polypeptide can be a Drosophila MICAL polypeptide. For example, 
the polypeptide can have an amino acid sequence as set forth in SEQ ID NO:8 {Drosophila 
long variant), SEQ ID NO: 10 {Drosophila long variant), or SEQ ID NO: 12 {Drosophila 
long variant). 

[0053] MICALs are also referred to as 151 proteins or Zephyrins. The arrangement of 
domains within a typical MICAL polypeptide are shown in FIG. 1 A. As indicated above, a 
MICAL polypeptide of the present invention typically includes the following domain 
organization from N-terminal to C-terminal: an N-terminal MICAL domain, a calponin 
homology domain, a first variable MICAL region, a LIM domain, a proline rich region, and 
a plexin interacting region. Furthermore, a MICAL polypeptide has monooxygenase 
activity and interacts with a plexin, typically plexin A. The MICALs appear unique with 
respect to containing both calponin homology (CH) and LIM domains, in addition to their 
conserved N- and C-terminal regions (Fig. 2A). 

[0054] In certain aspects, the present invention provides a polypeptide that includes a 
calponin homology domain. In fact, a MICAL polypeptide of the present invention includes 
a calponin homology domain. A calponin homology domain is a domain that has at least 
30% amino acid sequence identity to the calponin homology domain of SEQ ID NO:2, 
residues 508 to 612, SEQ ID NO:4, residues 516 to 622, SEQ ID NO:6, residues 518 to 624, 
SEQ ID NO:8, residues 562 to 669, SEQ ID NO: 10, residues 562 to 669, and/or SEQ ID 
NO: 12, residues 562 to 669. In certain aspects, the polypeptide through the calponin 
homology domain can interact with actin. The calponin homology domain in certain 
aspects has at least 40%, 50%, 70%, 75%, 80%, 90%, 95%, or 99% sequence identity to 
SEQ ID NO:2, residues 508 to 612, SEQ ID NO:4, residues 516 to 622, SEQ ID NO:6, 
residues 518 to 624, SEQ ID NO:8, residues 562 to 669, SEQ ID NO:10, residues 562 to 
669, and/or SEQ ID NO: 12, residues 562 to 669. 

[0055] A polypeptide of the present invention in certain aspects includes a LIM domain. 
In fact, A MICAL polypeptide of the present invention includes a LIM domain. A LIM 
domain (Bach (2000), supra) is a domain that has at least 30% amino acid sequence identity 
to a LIM domain of SEQ ID NO:2, residues 697 to 750, SEQ ID NO:4, residues 1002 to 
1056, SEQ ID NO:6, residues 792 to 851, SEQ ID NO:8, residues 1074 to 1129, SEQ ID 
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NO:10, residues 1074 to 1129, and/or SEQ ID NO:12, residues 806 to 861. The LIM 
domain in certain aspects has at least 40%, 50%, 70%, 75%, 80%, 90%, 95%, or 99% 
sequence identity to the LIM domain of SEQ ID NO:2, residues 697 to 750, SEQ ID NO:4, 
residues 1002 to 1056, SEQ ID NO:6, residues 792 to 851, SEQ ID NO:8, residues 1074 to 
1129, SEQ ID NO: 10, residues 1074 to 1129, and/or SEQ ID NO:12, residues 806 to 861. 
LIM domains mediate protein/protein interactions with other LIM domain-containing 
proteins (See Bach (2000), supra). 

[0056] MICAL polypeptide of the present invention includes a proline rich region. The 
proline rich region includes the proline rich region indicated in figure IB as well as variable 
region 2 indicated in Figure IB. Accordingly, the proline rich region extends between the 
LIM domain and the Plexin-interacting region, and includes a variable proline rich domain 
and a conserved proline rich domain. Thus, the proline rich region extends from the first 
residue of the N terminal of the Plexin interacting domain to the last C terminal residue of 
the LIM domain. The proline rich region is defined by the PXXP motifs (the SH3 binding 
domains).A proline rich region is a region that has at least 1 PXXP SH3 binding domain. In 
certain aspects, the proline rich region has at least 5, 6, 7, 8, 9, 10, 12, 1 1, 12, 13, 14, 15, 16, 
17, 18, 19, or 20 SH3 binding domains. The proline rich region in certain aspects of the 
invention has at least 1%, 2%, 3%, 4%, 5%, 6%, or 7% proline residues. The proline rich 
region in certain aspects has at least 40%, 50%, 70%, 75%, 80%, 90%, 95%, or 99% 
sequence identity to the proline rich region of SEQ ID NO:2, residues 751-909, SEQ ID 
NO:4, residues 1057 to a residue that is 230 amino acids from the C terminus, SEQ ID 
NO:6, residues 852 to a residue that is 190 amino acids from the C terminus, SEQ ID NO:8, 
residues 1130-4522, SEQ ID NO:10, residues 1130-2801, and/or SEQ ID NO:12, residues 
862-2533. The proline rich region includes numerous potential SH3 binding domains (See 
e.g., Wages et al., J. Virol. 66(4): 1866-74 (1992)) and can include at least one Ena binding 
domain. For example, there are 18 putative SH3 binding domains in the medium isoform 
(within the variable region 2) of Drosophila MICAL. 

[0057] A MICAL polypeptide of the present invention includes a plexin interacting 
region at its C-terminus. This domain is typically immediately C-terminal to the proline 
rich region. Typically, the plexin interacting region contains a predicted heptad-repeat, 
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coiled-coil structure (Figure IB), a motif thought to be involved in protein-protein 
interactions (Burkhard et al., 2001). Interestingly, this region of a MIC AL of the present 
invention typically shares amino acid similarity with several other coiled-coil domain- 
containing proteins including a portion of the alpha domain found in the Ezrin, Radixin, and 
Moesin (ERM) proteins (-22% identity; Bretscher et al., 2000). 

[0058] A plexin interacting region is a region that has at least 30% amino acid sequence 
identity to the plexin interacting region of SEQ ID NO:2, residues 910 to 1067, SEQ ID 
NO:4, residues 348 to 509 after the missing intervening sequence (labeled ". . .")> SEQ ID 
NO:6, residues 800 to 989 after the missing intervening sequence (labeled . ."), SEQ ID 
NO:8, residues 4522 to 4723, SEQ ID NO: 10, residues 2802 to 3002, and/or SEQ ID 
NO: 12, residues 2534 to 2734. The proline rich region in certain aspects has at least 40%, 
50%, 70%, 75%, 80%, 90%, 95%, or 99% sequence identity to the proline rich region of 
SEQ ID NO:2, residues 910 to 1067, SEQ ID NO:4, residues 348 to 509 after the missing 
intervening sequence (labeled ". . ."), SEQ ID NO:6, residues 800 to 989 after the missing 
intervening sequence (labeled ". . ."), SEQ ID NO:8, residues 4522 to 4723, SEQ ID NO:10, 
residues 2802 to 3002, and/or SEQ ID NO: 12, residues 2534 to 2734. 

[00591 In certain aspect, the last four amino acids of MICAL (ESII) are a PDZ protein 
binding motif (Harris and Lim, 2001). 

[0060] Typically, MICAL polypeptides of the present invention have two regions of 
varying length (See e.g., Figure IB), a first variable MICAL region and a second variable 
MICAL region, that have no significant similarity to any other proteins, and that appear to 
determine the size of the different MICAL proteins (Figure IB). The second variable region 
includes a high concentration of proline residues, and as indicated above, forms a portion of 
the proline rich region. For example, the second variable region of Drosophila MICAL 
medium isoform has 124 proline residues out of 1663 (i.e., 7.5% proline). The variable 
region and the proline rich region in figure IB, for the Drosophila medium isoform has 130 
prolines out of 1671 residues (i.e., 7.8% proline). 

[0061] Interposed between the first and the second variable regions, MICALs typically 
have a LIM domain as discussed above (Figure IB), a protein-protein interaction module 
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found in a variety of proteins involved in signal transduction cascades and in cytoskeletal 
organization (Bach, 2000), and also a calponin homology (CH) domain as discussed above 
(Figure IB), a domain also found in cytoskeletal and signal transduction proteins and 
known to be involved in actin filament binding (Gimona et al., 2002). 

[0062] The present invention also provides an isolated polypeptide as disclosed above, 
wherein the polypeptide includes an N-terminal MICAL domain having at least about 40%, 
45%, 50%, 55%, 60%, 65%. 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% 
sequence identity to the N-terminal MICAL domains of SEQ ID NOS:2, 4, 6, 8, 10, or 12, 
and has monooxygenase activity. 

[0063] The MICAL N-terminal domain of -500 amino acids is highly conserved among 
MICAL-related proteins, but is unique over its entire length in comparison to other proteins. 
This N-terminal region is referred to herein as the N-terminal MICAL domain. In certain 
aspects, the N-terminal MICAL domain includes the portion of a MICAL polypeptide that 
is N-terminal to the Calponin Homology domain. The N-terminal domain can be, for 
example about 500 to about 561 amino acids in length. N-terminal MICAL domains 
include residues 1-484 in SEQ ID NO:2, residues 1-492 in SEQ ID NO:4, residues 1-492 in 
SEQ ID NO:6, and residues 44-529 of SEQ ID Nos:8, 10, and 12. 

[0064] The N-terminal MICAL domain typically includes a consensus dinucleotide 
binding sequence, GxGxxG (Figures IB and 2A) which is distinct from the sequence 
present in classical mononucleotide binding motifs (Eggink et al., 1990; Eppink et al., 1997; 
Schulz, 1992; Wierenga et al., 1986). The N-terminal MICAL domain also typically 
includes three separate sequence motifs spaced throughout this domain that define them as 
flavoprotein monooxygenases (also called hydroxylases), a subclass of oxidoreductases 
(Eggink et al., 1990; Eppink et al., 1997; Wierenga et al., 1986). The amino acid sequence 
surrounding the GXGXXG motif typically match the consensus sequence for the ADP 
binding region of flavin adenine dinucleotide (FAD) binding proteins (Rossmann fold or 
FAD Fingerprint 1, Figures IB and 2 A), and distinguishes this region from consensus NAD, 
or NADP binding folds (Vallon, 2000; Wierenga et al., 1986). The N-terminal MICAL 
domain also typically has a well-conserved GD motif (FAD Fingerprint 2; Figures IB and 
2A) C-terminal to the FAD Fingerprint 1 region, which is important for binding the ribose 
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moiety of FAD (Eggink et aL, 1990; Eppink et al., 1997). Finally, N-terminal MICAL 
domains typically have the conserved DG motif ("Conserved Motif; Figures IB and 2A) 
between the FAD Fingerprint 1 and 2 motifs that has been reported to be involved in 
binding the pyrophosphate moiety of FAD (Eppink et al., 1997). 

[0065] Proteins with the consensus FAD binding regions of MICALs bind FAD and use 
FAD in the catalysis of oxidation-reduction reactions. Flavoprotein monooxygenases are 
oxidoreductases (enzymes that catalyze oxidation and reduction reactions) and catalyze the 
insertion of one atom of molecular oxygen into their substrate using nucleotides as electron 
donors (Massey, 1995). Accordingly, MICAL polypeptides of the present invention 
typically have flavoprotein monooxygenase activity. Like other monooxygenases of this 
type, polypeptides of the present invention typically use FAD as a co-enzyme. Apart from 
the three consensus regions reported above, monooxygenases typically vary significantly, 
reflecting the wide range of enzymes in this family and their variable substrate binding 
pockets also encompassed within this domain (Eppink et al., 1997). However, MICALs and 
other monooxygenases show significant similarity within these three FAD binding regions 
and also similar spacing of these regions within the monooxygenase domain. Therefore, 
polypeptides of the present invention typically have a high degree of identity with the three 
FAD binding regions (e.g., greater than 50%, 75%, 80%, 85%, 90%, 95%. 98% and 99% 
sequence identity) and similar spacing of these regions. 

[0066] As indicated above, polypeptides of the present invention have monooxygenase 
activity and bind FAD through their N-terminal MICAL domain. Methods are known in the 
art for determining whether a polypeptide binds FAD. For example, a solution of a 
polypeptide of the present invention, including a solution of a polypeptide that includes the 
N-terminal MICAL domain but not the other domains typically present on MICALs, such as 
MICAL-FM (see the Examples section herein), is yellow in color, a characteristic of 
flavoproteins. 

[0067] Accordingly, spectral analysis of a polypeptide of the present invention can be 
used to further identify the polypeptide as flavin binding polypeptides, and therefore to 
further determine whether a polypeptide includes an N-terminal MICAL domain. For 
example, a polypeptide that includes an N-terminal MICAL domain, especially a 
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polypeptide that includes none of the other MIC AL domains, typically has distinctive 
spectral properties. For example, a polypeptide that includes the N-terminal MICAL 
domain but none of the other MICAL domains can show an absorption peak at around 450 
nm and a shoulder at around 360 nm (Figure 3B). This is similar to the absorption spectra 
of FAD itself (~450nm and ~360nm; Macheroux, 1999), and to other related flavoproteins 
(e.g., p-Hydroxybenzoate Hydroxylase, Hosokawa and Stanier, 1966; and GidA, White et 
al., 2001). Monooxygenase activity and other flavoprotein activities can also be determined 
using other standard protocols as listed in Flavoprotein Protocols, Methods in Molecular 
Biology, Vol. 131, S.K. Chapman and G.A. Reid (Eds), Humana Press, 1999, which is 
incorporated by reference herein in its entirety. 

[0068] A MICAL polypeptide of the present invention can include additional functional 
domains observed on MICAL polypeptides (See Fig. 7). These additional domains include 
for example, a MICAL homology region of unknown function (Fig. 7), a putative IQ 
(calcium) binding domain and a PDZ ligand. Accordingly, in certain aspects, a MICAL 
polypeptide of the present invention includes, optionally from N-terminal to C-terminal, a 
flavoprotein monooxygenase domain (N-terminal MICAL domain), a calponin homology 
domain, a MICAL homology region of unknown function; a LIM Domain; a proline rich 
region (Putative SH3 Ligands), a putative IQ (calcium) binding domain; a putative Ena 
binding domain (Ena-like Proteins) (Renfranz and Beckerle, Curr. Opin. Cell Biol 14:88, 
(2002)), and a plexin interacting region. 

[0069] A MICAL polypeptide of the present invention can include a vimentin interacting 
region. 

[0070] As discussed in more detail herein, the disclosed sequences provided in this 
specification can be used to identify MICALs or MICAL-Like polypeptides of other 
species. For example, the sequences disclosed herein were used to search public EST and 
genomic databases to identify the sequence of mouse MICAL- 1 (SEQ ID NO:21) (FIG. 10) 
(see below). Accordingly, in another aspect, the present invention provides a mouse 
MICAL polypeptide, such as a polypeptide according to SEQ ID NO:21, SEQ ID NO:22, or 
SEQ ID NO:23. 
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[0071] Using the Drosophila MICAL amino acid sequence (or MICALs from human or 
any other species) the publicly available EST and genomic databases can be searched to 
identify MICAL proteins in other species. The MICAL N terminal flavoprotein 
monooxygenase domain identified herein, is extremely highly conserved between MICALs 
in all species (>50% identity). Furthermore, the N terminal flavoprotein monooxygenase 
domain (i.e., the N-terminal MICAL domain) is not very similar («<20% identity) to 
anything else in the data base (in the genome) over the MICAL -500 amino acid 
flavoprotein monooxygenase stretch. Therefore, using this domain to search is extremely 
fast and accurate to find MICAL sequences. The searching strategy can be a standard, 
routinely used search strategy. The degree of similarity between MICALs in all species is 
so high that by searching a public database with the Drosophila MICAL flavoprotein 
monooxygenase amino acid sequence using the publicly available Blast searching 
program's TBLASTN command one can pull out the coding exons from ordered or 
unordered (rough draft) genomic sequence. TBLASTN searches a nucleotide database for 
DNA that is translated by the program into amino acid residues. The alignments can be 
observed on a web browser. The aligned sequence can then be assembled to identify the 
coding sequence for the MICAL(s) for any species of interest. An EST nucleotide database 
can also be searched in a similar manner to a genomic database and as quickly. The 
sequences provided herein, including the sequences of Figure 20 and the mouse MICAL 
sequences (SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23), provide examples of 
sequences that were obtained in this manner. This strategy would effectively and 
unequivocally identify all of the raw sequences in the database that code for a MICAL 
protein. 

[0072] Furthermore, the rest of the MICAL protein is also highly conserved. Therefore, 
the other portions of the Drosophila MICAL amino acid sequence and the amino acid 
sequence from those portions of the flavoprotein monooxygenase domain previously 
identified for a particular species, can then be used to assemble publicly available sequences 
to identify the full MICAL protein. This can efficiently be done by searching similar 
databases and using a similar strategy and piecing together the aligned sequence to get a full 
MICAL sequence. 
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[0073] In certain aspects, a polypeptide of the present invention is at least 100, 200, 300, 
400, 500, 1000, 1500, 2000, 2500, or 3000 amino acids in length. 

[0074] In another aspect, a polypeptide of the present invention is a functional portion of 
a MICAL polypeptide. A functional portion of a MICAL polypeptide is a polypeptide that 
includes at least an N-terminal MICAL domain that retains monooxygenase activity and/or 
a functional MICAL plexin interacting region. 

[0075] In another embodiment, the present invention provides an isolated polypeptide 
that includes an N-terminal MICAL domain, but not one or more of the other domains and 
regions typically found on a MICAL polypeptide. In one aspect, the polypeptide has an 
N-terminal MICAL domain but no other domain or region of a MICAL polypeptide. The 
N-terminal MICAL domain for example, is at least 40%, 50%, 75%, 80%, 90%, 95%, 98%, 
99%, or 100% identical to an N-terminal MICAL domain of a naturally-occurring MICAL, 
such as Drosophila MICAL or human MICAL 1, 2, or 3. The polypeptide typically retains 
monooxygenase activity and typically includes the consensus dinucleotide binding sequence 
and three motifs found in flavoprotein monooxygenases disclosed above. 

[0076] The MICAL polypeptides of the present invention have axon guidance regulatory 
activity. Axon guidance regulatory activity is the ability to affect the positioning, steering, 
and/or outgrowth of an axon in vivo or in vitro. Not to be limited by theory, it is believed 
that MICAL polypeptides of the present invention regulate axon guidance by associating 
with plexins, thereby being involved in semaphorin-plexin mediated repulsive axon 
guidance, especially Semaphorin la (Sema-la)-PlexA-mediated repulsive axon guidance, as 
discussed in more detail hereinbelow. 

[0077] The examples section herein illustrates several methods that can be used to 
identify axon guidance regulatory activity, referred to herein as axon guidance regulatory 
activity assays. For example, where the polypeptide being analyzed, or an ortholog thereof, 
is encoded for by a Drosophila gene, Drosophila mutants can be generated that are loss of 
function or gain of function mutants. For example, by deleting all or part of the gene 
encoding the polypeptide, a loss of function mutant can be generated. If the polypeptide has 
axon guidance regulatory activity, Drosophila loss of function mutants devoid of the 
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function of the polypeptide should exhibit motor axon guidance defects similar to the 
distinct and highly penetrant defects seen in Semala and PlexA loss of function mutants and 
seen in the MICAL loss of function mutants discussed in the Examples section. 

[0078] As another example, axon guidance regulatory activity can be identified by 
employing an in vitro rat DRG growth cone repulsion assay (Messersmith et al., 1995). The 
method involves co-culturing E14/15 rat DRG explants with 293 or COS cells expressing 
Sema3A in the presence of an inhibitor of an on-test polypeptide, as illustrated in the 
Examples section. NGF-dependent DRG axons exhibit little to no outgrowth toward 
Sema3A-secreting 293 cell aggregates. If the on-test polypeptide has axon guidance 
activity then inhibitors of the activity of the polypeptide will inhibit axon repulsion (See 
e.g., Fig. 4). Axon guidance regulatory activity can also be determined, for example, using 
single cell turning assays as described by Poo et al. {Neuron, 19, 1225-35 (1997)), growth 
cone collapse assays as described by Raper et al. (See e.g., Luo et al, Cell 75, 217-27 
(1993)), and mouse knock-out genetic approaches where phenotypes can be observed that 
must originate from loss of a repulsive response (based on expression data of the ligand, 
etc. . .) (See e.g., Giger et al., Neuron, 25, 29 (2000)). 

[0079] It will be recognized that strategies used herein to identify MICALs and MICAL- 
like proteins can be used to identify additional MICAL polypeptides, such as MICAL 
polypeptides of other mammalian species. For example, as illustrated in the Examples, a 
yeast 2 hybrid system that uses the terminal highly conserved "C2" portion of the PlexA 
cytoplasm domain can be used to screen cDNA libraries prepared from any organism. 
Additionally, MICAL polypeptides can be identified by the ability to rescue mutant 
organisms, such as mutant Drosophila prepared using methods disclosed herein, which lack 
MICAL function. Finally, recombinant DNA technologies can be used to identify and/or 
develop polynucleotides that encode MICAL or MICAL-Like polypeptides, that are related 
to, but distinct from those disclosed herein as discussed in more detail hereinbelow. 

[0080] Accordingly, the present invention provides an isolated polypeptide as disclosed 
above, wherein the isolated polypeptide is at least, for example, 75%, 80%, 85%, 90%, 
95%, 97%, 98%, 99%, or 99.9% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO: 10, or SEQ ID NO: 12. 
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[0081] A polypeptide of the present invention can be a MICAL variant, ortholog, 
isoform, or mutant. Provided herein are sequences of certain human and Drosophila 
MICALs. However, it will be recognized that additional variants are likely to exist in a 
population, which have different sequences than the disclosed MICALs. These alleles will 
likely be highly related in sequence to the disclosed MICALs, and can be further identified 
as a MICAL by their axonal guidance regulation activity, and by the position in the genome 
of the polynucleotide sequence encoding the allele. Therefore, based on the presently 
disclosed MICAL sequences, orthologous sequences in other species besides human or 
Drosophila, such as rat or mouse, can be identified using methods well known in the art, as 
illustrated herein. Methods for identifying MICAL polynucleotides of other species are 
discussed in detail below. 

[0082] Furthermore, additional MICAL polypeptides and polynucleotides encoding the 
MICAL polypeptides, can be identified using protein alignment tools, such as those 
reported in the Examples included herein. As will be recognized, these tools include, for 
example, PFAM, BLAST, PRINTS, JALVIEW, AND ClustalX, some of which are 
discussed in more detail herein. 

[0083] Finally, as disclosed herein, MICAL transcripts in at least some species are 
alternately spliced so as to give rise to different polypeptide isoforms from the same 
MICAL gene. These polypeptide variants or isoforms are examples of MICAL 
polypeptides of the present invention. MICAL genes cover greater than 40kb of genomic 
sequences and have at least 25 exons (See Fig. 1 A and Fig. 5). Based on analysis of 
isolated cDNAs and Western analysis (See Fig. 6D), there are at least three Drosophila 
MICAL isoforms, "long" (See e.g. SEQ ID NO:8, "medium" (SEQ ID NO: 10), and "short" 
(SEQ ID NO: 12) (Fig. 1 A and Figs. 13-15). 

[0084] In another embodiment, the present invention provides a MICAL consensus 
polypeptide. A MICAL consensus polypeptide is a polypeptide that includes at least 50%, 
75%, 80%, 85%, 90%, 95%, 99%, or 100% of the most prevalent amino acid occurrences of 
all MICALs, or of MICALs of a species, such as human MICALs. It will be recognized 
that amino acid sequences of polypeptides of a protein family such as MICALs, can be 
aligned and an amino acid sequence of a polypeptide with axonal guidance regulatory 



WO 03/066821 



PCT/US03/03551 



22 

activity can be identified which is different than all of the naturally-occurring family 
members, but which includes at least some of the most common amino acid occurrences of 
the naturally-occurring family members. 

[0085] Experimental results presented in the Examples section herein demonstrate that at 
least some MICALs directly associate with plexins and are required for semaphorin- 
mediated repulsive axon guidance. Furthermore, as disclosed above, MICALs contain 
multiple domains that are known to be important for interactions with actin, intermediate 
filaments, and cytoskeletal-associated adaptor proteins. Therefore MICALs are excellent 
candidates for directly mediating the cytoskeletal alterations characteristic of semaphorin 
signaling and provide novel targets for the attenuation of axonal repulsion. 

[0086] During neural development axons reach their appropriate targets by interpreting a 
myriad of guidance cues present in their environment. Semaphorin proteins, one of the 
largest families of guidance cues, are known to influence axon pathfinding, fasciculation, 
branching, and neuronal cell migration (He et al., 2002; Raper, 2000). A chemorepulsive 
role in axon guidance has been extensively demonstrated both in vitro and in vivo for many 
semaphorins, but they also mediate attractive neuronal guidance. 

[0087] The 7 classes of semaphorins include both transmembrane and secreted proteins 
and are evolutionarily conserved, structurally and in many cases functionally, from 
invertebrates to vertebrates (Semaphorin Nomenclature Committee, 1999). For example, 
the transmembrane semaphorin Sema-la in Drosophila is present on developing motor 
axons and acts as a repellent to regulate motor axon fasciculation in vivo (Yu et al., 1998). 
The related vertebrate transmembrane semaphorin Sema6A also functions as a repellent for 
axons of sympathetic neurons in vitro (Xu et al., 2000). Sema 3A, a well-characterized 
vertebrate secreted semaphorin, is a potent axonal repellent for a variety of neurons in vitro, 
and in vivo serves as a chemorepellent essential for the establishment of many axonal 
pathways (Raper, 2000). Similarly, the related Drosophila secreted semaphorin Sema-2a is 
expressed on developing muscles and regulates motor axon pathfinding as a target-derived 
chemorepellent (Matthes et al., 1995; Winberg et al., 1998a). 
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[0088] Insight into how semaphorins signal repulsive guidance comes from work 
showing that plexins, a large family of evolutionarily conserved transmembrane proteins, 
serve as signal transducing receptors for both membrane-bound and secreted semaphorins 
(Tamagnone and Comoglio, 2000). The four classes of plexins have been found to 
associate directly with members of five different semaphorin classes. In the Drosophila 
nervous system plexin A (PlexA) is a functional receptor in vivo for Semala-mediated 
motor axon repulsion (Winberg et al., 1998b). In vertebrates, repulsion mediated by class 3 
secreted semaphorins is dependent on plexin function both in vitro and in vivo (Cheng et al., 
2001; reviewed in Tamagnone and Comoglio, 2000). However, repulsive guidance 
mediated by class 3 semaphorins, including SemaSA and Sema3F, requires a holoreceptor 
complex which includes a ligand-binding obligate co-receptor, neuropilin-1 or neuropilin-2, 
and a class A plexin. Plexin cytoplasmic domains are highly conserved and, for certain A 
class plexins, are responsible for signaling semaphorin-mediated repulsive axon guidance 
(Cheng et al., 2001; Takahashi and Strittmatter, 2001). 

[0089] The repulsive nature of semaphorin signaling mediated by plexin receptors is due 
to the modification of the growth cone cytoskeleton. For example, following exposure to 
secreted Sema3 A, growth cones undergo rapid collapse which is accompanied by the 
depolymerization of F-actin and decreased ability to polymerize new F-actin (Fan et al., 
1993). Several modulators of cytoskeletal dynamics have been implicated in this process 
including Rho family GTPases, p21 -activated kinase (PAK), and LIM kinase (Liu and 
Strittmatter, 2001; Whitford and Ghosh, 2001). In addition, members of the collapsin 
response mediator protein (CRMP) family, the Ig superfamily protein LI , intracellular 
levels of cGMP, and the catalytically inactive receptor tyrosine kinase family member 
offtrack (OTK), have also been implicated in transducing semaphorin repulsive guidance 
(He et al., 2002). It remains unknown, however, how plexins directly regulate the activity 
of these signaling molecules in order to modulate cytoskeletal dynamics.. 

[0090] In another embodiment, the present invention provides an isolated polypeptide 
that includes a plexin interacting region. The plexin interacting region is typically at least 
40%, 50%, 75%, 80%, 90%, 95%, 98%, 99%, or 100% identical to a plexin interacting 
region of a naturally-occurring MICAL, such as Drosophila MICAL or human MICAL 1, 2, 
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or 3, and retains the ability to interact with a plexin. The plexin interacting region for 
example, can be at least 90% identical to a plexin interacting region of Drosophila MICAL 
(SEQ ID NO:8, SEQ ID NO:10, or SEQ ID NO:12), or to a plexin interacting region of 
human MICAL 1 (SEQ ID NO:2), human MICAL 2 (SEQ ID NO:4), human MICAL 3 
(SEQ ID NO:6), or a conservative variant thereof. A conservative variant is a polypeptide 
that is identical to another polypeptide except for conservative amino acid substitutions, as 
discussed hereinbelow. 

[0091] A polypeptide of this embodiment of the invention typically retains the ability to 
specifically interact with all or part of a plexin either directly or indirectly. For example, 
the isolated polypeptide can directly interact with the C2 domains of a PlexA, such as 
PlexA3 and PlexA4. As disclosed in the Examples herein, human MICAL- 1 and mouse 
MICAL-2 specifically interact with the C2 domains of human PlexA3 and mouse PlexA4, 
respectively. Indirect interactions can be identified, for example, using genetic approaches 
illustrated in the Examples herein. 

[0092] Methods for determining whether an on-test polypeptide is capable of interacting 
with a plexin are well-known in the art. For example, traditional methods of identifying 
specific protein interactions can be used. Accordingly, immunoprecipitation can be used to 
identify whether a polypeptide interacts with a plexin by determining whether the on-test 
polypeptide and a plexin coimmunoprecipitate, as illustrated in the Examples section. 
Furthermore, for example, a plexin protein can be isolated on a protein gel, and binding of a 
labeled on-test polypeptide can be determined. Alternatively, for example, a yeast 
interaction assay can be used, as disclosed in the Examples herein (See Fig. 5). 

[0093] The isolated polypeptide that includes a plexin interacting region, in certain 
aspects, does not have one or more other domains and/or activities typically present in a 
MICAL polypeptide. For example, the polypeptide in certain aspects does not have 
monooxygenase activity. The polypeptide of this embodiment of the invention can be a 
mutant MICAL polypeptide that acts as a dominant negative mutant with respect to MICAL 
activity. For example, as illustrated in the Examples section, the polypeptide can be a 
truncated MICAL polypeptide that includes at least one, but not all, functional domains 
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typically present on a MICAL polypeptide. Alternatively, the mutant can include mutations 
that alter, for example by destroying, certain MICAL functions. 

[0094] The examples section provided herein provides a MICAL G ^ W mutant (SEQ ID 
NO:20) that is mutated in the three glycine residues within the FAD fingerprint 1 motif of 
MICAL to tryptophan, a mutation known in other proteins to disrupt FAD binding without 
altering the overall structure of the protein (Kubo et al., 1997; Lawton and Philpot, 1993; 
Wierenga et al., 1986). As illustrated in the Examples section, the MICAL G ^ W mutant 
which includes an intact plexin-interacting domain but is functionally inactive, exerts a 
dominant-negative effect on motor axon guidance in a wild-type genetic background. Not 
to be limited by theory, it is believed that MICAL G_>W exerts its dominant negative effect by 
binding to a plexin, thereby competing for binding of wild-type MICAL to the plexin target. 

[0095] As illustrated in the Examples herein, a polypeptide according to this 
embodiment of the invention can be a truncated mutant that only includes a plexin- 
interacting region. Accordingly, the polypeptide can have an amino acid sequence as set 
forth in SEQ ID NO: 19. The polypeptide can be targeted to the membrane by including a 
membrane targeting sequence such as an N-terminal myristoylation sequence as illustrated 
in the Examples section herein (see mutant MICAL Myr->CT ). Other membrane targeting 
sequences such as, for example, a palmitylation sequence, can be used. 

[0096] A polypeptide of this embodiment of the invention, can include, for example, a 
MICAL or a MICAL-Like plexin interacting domain, since both of these protein families 
include a plexin interacting domain. 

[0097] In another embodiment the present invention relates to a family of MIC AL-like 
(MICAL-L) proteins, members of which have a similar organization to MICALs but lack 
the region N-terminal to the CH domain (Figure 2B). MICAL-L proteins include at least 
one MICAL-L protein in Drosophila (D-MICAL-L) and at least two family members in 
humans. D-MICAL-L cDNA and genomic DNA sequence information suggest that D- 
MICAL-L Plexin interacting domain begins just N-terminal to the CH domain of a MICAL 
protein. Analysis of publicly available mammalian cDNA and genomic sequences suggests 
that human MICAL-L1 and MICAL-L2 are similar in overall domain organization to D- 
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MICAL-L and do not contain the highly conserved -500 amino acid MICAL N-terminal 
domain. 

[0098] Accordingly, the present invention provides an isolated polypeptide that includes 
a plexin interacting region and alternatively one or more of a calponin homology domain, a 
LIM domain, and a proline rich region. The polypeptide can also include a first variable ' 
region and a second variable region. Accordingly, in one aspect, the polypeptide is or 
includes a MICAL-like polypeptide. A MICAL-Like polypeptide includes, from N- 
terminal to C-terminal, a calponin homology domain, a first variable MICAL region, a LIM 
domain, a second variable MICAL region, a proline rich region, and a plexin interacting 
region. 

[0099] A polypeptide according to this aspect of the invention typically specifically 
interacts with a plexin, as discussed above for MICAL polypeptides of the present 
invention. 

[0100] The polypeptide, for example, includes a calponin homology domain, followed 
by a first variable region, followed by a LIM domain, followed by a second variable region, 
followed by a proline rich region, and followed by a plexin interacting region. Such 
polypeptides include a variant, ortholog, isoform, or mutant of a MICAL-Like protein 
disclosed herein. 

[01 01] In one aspect, the polypeptide is a mammalian MICAL-Like polypeptide. For 
example, the isolated polypeptide can be human MICAL-Like 1 or human MICAL-Like 2. 
Accordingly, the isolated polypeptide can have an amino acid sequence that is at least 40%, 
50%, 75%, 80%, 90%, 95o/o, 98%, 99%, or 100% identical to a an amino acid sequence as ' 
set forth in SEQ ID NO:14 (human MICAL-Like 1) or SEQ ID NO:16 (human 
MICAL-like 2). 

[0102] In another aspect, the isolated polypeptide is a Drosophila MICAL-L 
polypeptide. For example, the polypeptide can have an amino acid sequence as set forth in 
SEQ ID NO:18 {Drosophila MICAL-Like), or a variant, ortholog, isoform, or mutant 
thereof. 



WO 03/066821 



PCT/US03/03551 



27 

[0103] In another aspect, the polypeptide of the present invention is a functional portion 
of a MICAL-Like polypeptide. A functional portion of a MICAL polypeptide is a 
polypeptide that includes at least a functional domain, for example a functional MICAL 
plexin interacting region. 

[01 04] A functional peptide portion of a MICAL or MICAL-Like polypeptide for 
example, can be obtained by examining peptide portions of a MICAL or MICAL-Like 
polypeptide using methods as provided herein or other standard methods, to identify 
fragments that retain at least one of the activities of a wild-type MICAL including the 
ability to interact with a plexin, particularly Plexin A, and monooxygenase activity. 

[0105] A functional peptide portion of a MICAL or MICAL-Like polypeptide that 
specifically interacts with plexin can be identified using any of various assays known to be 
useful for identifying specific protein-protein interactions. Such assays include, for 
example, methods of gel electrophoresis, affinity chromatography, the two hybrid system of 
Fields and Song {Nature 340:245-246, 1989; see, also, U.S. Patent No. 5,283,173; Fearon et 
al., Proc. Natl. Acad. Set, USA 89:7958-7962, 1992; Chien et al., Proc. Natl. Acad. Sci. 
USA 88:9578-9582, 1991; Young, Biol. Reprod. 58:302-311(1998), each of which is 
incorporated herein by reference), the reverse two hybrid assay (Leanna and Hannink, Nucl. 
Acids Res. 24:3341-3347, 1996, which is incorporated herein by reference), the repressed 
transactivator system (U.S. Patent No. 5,885,779, which is incorporated herein by 
reference), the phage display system (Lowman, Ann. Rev. Biophys. Biomol. Struct. 26:401- 
424, 1997, which is incorporated herein by reference), GST/HIS pull down assays, mutant 
operators (WO 98/01 879, which is incorporated herein by reference), the protein 
recruitment system (U.S. Patent No. 5,776,689, which is incorporated herein by reference), 
and the like (see, for example, Mathis, Clin. Chem. 41:139-147, 1995 Lam, Anticancer 
Drug Res. 12:145-167, 1997; Phizicky et al., Microbiol. Rev. 59:94-123, 1995; each of 
which is incorporated herein by reference). 

[01 06] A functional peptide portion of a MICAL or MICAL-Like polypeptide also can 
be identified using methods of molecular, modeling. For example, an amino acid sequence 
of a MICAL or MICAL-Like polypeptide can be entered into a computer system having 
appropriate modeling software, and a three dimensional representation of the MICAL or 
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MICAL-Like polypeptide ("virtual MICAL" or "virtual MICAL-Like polypeptide ") can be 
produced. A MICAL or MICAL-Like polypeptide amino acid sequence also can be entered 
into the computer system, such that the modeling software can simulate portions of the 
MICAL or MICAL-Like polypeptide sequence, and can identify those peptide portions that 
can interact specifically, for example, with the virtual plexin. 

[0107] It should be recognized that such methods, including two hybrid assays and 
molecular modeling methods, also can be used to identify other specifically interacting 
molecules encompassed within the present invention. For example, the methods can be 
used to identify other proteins to which MICALS and/or MICAL-Like proteins bind, as 
revealed by the various domains of these proteins. 

[0108] Modeling systems useful for the purposes disclosed herein can be based on 
structural information obtained, for example, by crystallographic analysis or nuclear 
magnetic resonance analysis, or on primary sequence information (see, for example, 
Dunbrack et ah, "Meeting review: the Second meeting on the Critical Assessment of 
Techniques for Protein Structure Prediction (CASP2) (Asilomar, California, December 13- 
16, 1996). Fold Des. 2(2): R27-42, (1997); Fischer and Eisenberg, Protein Sci. 5:947-55, 
1996; (see, also, U.S. Patent No. 5,436,850); Havel, Prog. Biophys. Mol Biol 56:43-78, 
1991; Lichtarge et aL, J. Mol Biol 274:325-37, 1997; Matsumoto et al., J. Biol Ghent. 
270:19524-31, 1995; Sali et aL, J. Biol Ghent. 268:9023-34, 1993; Sali, Molec. Med. Today 
1:270-7, 1995a; Sali, Curr. Opin. Biotechnol. 6:437-51, 1995b; Sali et aL, Proteins 23: 318- 
26, 1995c; Sali, Nature Struct. Biol. 5:1029-1032, 1998; U.S. Patent No. 5,933,819; U.S. 
Patent No. 5,265,030, each of which is incorporated herein by reference). 

[0109] The crystal structure coordinates of a MICAL or MICAL-Like polypeptide can be 
used to design compounds that bind to the protein and alter its physical or physiological 
properties in a variety of ways. The structure coordinates of the protein can also be used to 
computationally screen small molecule databases for agents that bind to the polypeptide to 
develop modulating or binding agents, which can act as agonists or antagonists of MICAL 
axon guidance regulatory activity. Such agents can be identified by computer fitting kinetic 
data using standard equations (see, for example, Segel, "Enzyme Kinetics" (J. Wiley & Sons 
1975), which is incorporated herein by reference). 
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[0110] Methods of using crystal structure data to design inhibitors or binding agents are 
known in the art. For example, MICAL or MICAL-Like polypeptide coordinates can be 
superimposed onto other available coordinates of similar proteins, including proteins having 
a bound inhibitor, to provide an approximation of the way the inhibitor interacts with the 
receptor. Computer programs employed in the practice of rational drug design also can be 
used to identify compounds that reproduce interaction characteristics similar to those found, 
for example, between a MICAL or MICAL-Like polypeptide and a co-crystallized plexin. 
Detailed knowledge of the nature of the specific interactions allows for the modification of 
compounds to alter or improve solubility, pharmacokinetics, and the like, without affecting 
binding activity. 

[0111] Computer programs for carrying out the activities necessary to design agents 
using crystal structure information are well known. Examples of such programs include, 
Catalyst Databases™ - an information retrieval program accessing chemical databases such 
as BioByte Master File, Derwent WDI and ACD; Catalyst/HYPO™ - generates models of 
compounds and hypotheses to explain variations of activity with the structure of drug 
candidates; Ludi™ - fits molecules into the active site of a protein by identifying and 
matching complementary polar and hydrophobic groups; and Leapfrog™ - "grows" new 
ligands using a genetic algorithm with parameters under the control of the user. 

[01 12] Various general purpose machines can be used with such programs, or it may be 
more convenient to construct more specialized apparatus to perform the operations. Generally, 
the embodiment is implemented in one or more computer programs executing on 
programmable systems each comprising at least one processor, at least one data storage system 
(including volatile and non-volatile memory and/or storage elements), at least one input device, 
and at least one output device. The program is executed on the processor to perform the 
functions described herein. 

[0113] Each such program can be implemented in any desired computer language, 
including, for example, machine, assembly, high level procedural, or object oriented 
programming languages, to communicate with a computer system. In any case, the language 
may be a compiled or interpreted language. The computer program will typically be stored on 
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a storage media or device, for example, a ROM, CD-ROM, magnetic or optical media, or the 
like, that is readable by a general or special purpose programmable computer, for configuring 
and operating the computer when the storage media or device is read by the computer to 
perform the procedures described herein. The system may also be considered to be 
implemented as a computer-readable storage medium, configured with a computer program, 
where the storage medium so configured causes a computer to operate in a specific and 
predefined manner to perform the functions described herein. 

[0114] Embodiments of the invention include systems, for example, internet based systems, 
particularly computer systems which store and manipulate coordinate information obtained by ' 
crystallographic or NMR analysis, or amino acid or nucleotide sequence information, as 
disclosed herein. As used herein, the term "computer system" refers to the hardware 
components, software components, and data storage components used to analyze coordinates 
or sequences as set forth herein. The computer system typically includes a processor for 
processing, accessing and manipulating the sequence data. The processor can be any well 
known type of central processing unit, for example, a Pentium E or Pentium m processor from 
Intel Corporation, or a similar processor from Sun, Motorola, Compaq, Advanced 
MicroDevices or International Business Machines. 

[01 15] Typically the computer system is a general purpose system that comprises the 
processor and one or more internal data storage components for storing data, and one or more 
data retrieving devices for retrieving the data stored on the data storage components. A skilled 
artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 

[0116] Where it is desired to identify a chemical entity that interacts specifically with 
MICAL or MICAL-Like polypeptide, any of several methods to screen chemical entities or 
fragments for their ability to interact specifically with the molecule can be used. This process 
may begin by visual inspection, for example, of MICAL or MICAL-Like polypeptide on the 
computer screen. Selected peptide portions of MICAL or MICAL-Like polypeptid 
chemical entities that can act as mimics, then can be positioned in a variety of orientations 
docked, within an individual binding site of the MICAL or MICAL-Like polypeptid 
Docking can be accomplished using software such as Quanta and Sybyl, followed by 
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minimization and molecular dynamics with standard molecular mechanics forcefields, such as 
CHARMM and AMBER. 

[01 1 7] Specialized computer programs can be particularly useful for selecting peptide 
portions of a prodomain, or chemical entities useful, for example, as a MICAL or MICAL- 
Like polypeptide agonist or antagonist. Such programs include, for example, GRID 
(Goodford, J. Med. Chem., 28:849-857, 1985; available from Oxford University, Oxford, UK); 
MCSS (Miranker and Karplus, Profez^: Structure. Function and Genetics 11:29-34, 1991, 
available from Molecular Simulations, Burlington MA); AUTODOCK (Goodsell and Olsen, 
Proteins: Structure. Function, and Genetics 8:195-202, 1990, available from Scripps Research 
Institute, La Jolla CA); DOCK (Kuntz, et al., J. Mol. Biol. 161:269-288, 1982, available from 
University of California, San Francisco CA), each of which is incorporated herein by 
reference. 

[01 18] Suitable peptides or agents that have been selected can be assembled into a single 
compound or binding agent. Assembly can be performed by visual inspection of the 
relationship of the fragments to each other on the three-dimensional image displayed on a 
computer screen, followed by manual model building using software such as Quanta or Sybyl. 
Useful programs to aid one of skill in the art in connecting the individual chemical entities or 
fragments include, for example, CAVEAT (Bartlett et al, Special Pub., Royal Chem. Soc. 
78:182-196, 1989, available from the University of California, Berkeley CA); 3D Database 
systems such as MACCS-3D (MDL Information Systems, San Leandro CA; for review, see 
Martin, J. Med. Chem. 35:2145-2154, 1992); HOOK (available from Molecular Simulations, 
Burlington, Mass.), each of which is incorporated herein by reference. 

[0119] In another embodiment, the present invention provides an isolated polynucleotide 
encoding a polypeptide of the present invention disclosed hereinabove. Accordingly, a 
polynucleotide of the present invention can encode a MICAL polypeptide or a MICAL-like 
polypeptide of the present invention. A polynucleotide of the present invention that 
encodes a MICAL polypeptide encodes an isolated polypeptide that includes one or more of 
an N-terminal MICAL domain, a calponin homology domain, a LIM domain, a proline rich 
region, and a plexin interacting region, wherein the polypeptide has monooxygenase 
activity, plexin interacting activity, and/or axon guidance regulatory activity. The encoded 
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polypeptide can also include a first variable region and a second variable region surrounding 
the L1M domain. 

[0120] A polynucleotide of the present invention that encodes a MICAL-Like 
polypeptide encodes an isolated polypeptide that includes one or more of a calponin 
homology domain, a LIM domain, a proline rich region, and a plexin interacting region, 
wherein the polypeptide has plexin interacting activity. The encoded polypeptide can also 
include a first variable region and a second variable region surrounding the LIM domain. 

[0121] In one aspect the polynucleotide encodes a mammalian MICAL polypeptide, or a 
functional portion thereof, or MICAL-like polypeptide, or a functional portion thereof. For 
example, the polynucleotide can encode all or a portion of human MICAL- 1, human 
MICAL-2, human MICAL-3, human MICAL-Like 1, or human MICAL-like 2. As such the 
polynucleotide can include all or a portion (e.g. a cDNA, or a coding region) of a human 
MICAL- 1 gene, human MICAL-2 gene, human MICAL-3 gene, human MICAL-Like 1 
gene, or human MICAL-like 2 gene. The polynucleotide, for example, can include a coding 
region or an entire transcript. Accordingly, the polynucleotide can encode a polypeptide that 
includes an amino acid sequence as set forth in SEQ ID NO:2 (human MICAL- 1), SEQ ID 
NO:4 (human MICAL-2), SEQ ID NO:6 (human MICAU3), SEQ ID NO: 14 (human 
MICALXike 1), or SEQ ID NO:16 (human MICAL-Like 2), or an isoform thereof. 

[0122] The polynucleotide can include a polynucleotide that is at least 50%, 75%, 80%, 
85%, 90%, 95%, 98%, 99%, or 100% identical to a MICAL coding sequence as set forth in 
figures 1 1 to 16. Accordingly, a polynucleotide of the present invention in certain aspects, 
includes a coding nucleotide portion of SEQ ID NO:l (human MICAL-1 cDNA), SEQ ID 
NO:3 (human MICAL-2 coding sequence), SEQ ID NO:5 (human MICAL-3 cDNA), SEQ 
ID NO: 13 (human MICALXike 1 cDNA), or SEQ ID NO: 14 (human MICAL-Like 2 
cDNA), or a portion thereof. The polynucleotide can include an entire MICAL cDNA or 
gene, or an entire MICAL-Like cDNA or gene, or a portion thereof. 

[0123] A polynucleotide according to this embodiment of the invention can encode a 
Drosophila MICAL polypeptide or MICAL-Like polypeptide, for example a polypeptide 
having the sequence as set forth in SEQ ID NO:8 {Drosophila MICAL long isoform), SEQ 
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ID NO:10 {Drosophila MICAL medium isoform), SEQ ID NO:12 {Drosophila MICAL 
short isoform), or SEQ ID NO: 18 {Drosophila MICAL-Like polypeptide). For example, 
the polynucleotide can be 50%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% identical 
to a nucleotide sequence as set forth in SEQ ID NO:7 (Drosophila MICAL long isoform 
cDNA sequence), SEQ ID NO:9 (Drosophila MICAL medium isoform cDNA sequence), 
SEQ ID NO:ll (Drosophila MICAL short isoform cDNA sequence), or SEQ ID NO:17 
(Drosophila MICAL-Like cDNA sequence), or a portion thereof. 

[0124] Polynucleotides of the present invention are typically at least 15, 25, 50, 75, 100, 
125, 150, 200, 250, 500, 1000, 2500, 5000, 10000, 25000, 5000, 10000, 15000, 20000, 
25000, 30000, or 40,000 nucleotides in length. 

[0125] hi another embodiment, the present invention provides a polynucleotide that 
specifically hybridizes to a polynucleotide that encodes a MICAL polypeptide or a MICAL- 
Like polypeptide of the present invention. For example, the polynucleotide can specifically 
hybridize to a polynucleotide that encodes all or a portion of a polypeptide of 
SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, 
SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, or SEQ ID NO:18, or a complement 
thereof, that in certain aspects has monooxygenase activity. 

[0126] A polynucleotide of the present invention can inhibit expression of a 
polynucleotide that encodes a MICAL polypeptide or a MICAL-Like polypeptide of the 
present invention. The polynucleotide can include a polynucleotide that is complementary 
to a nucleotide sequence that is at least 50%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 
100% identical to a all or a portion, such as a coding portion, of a nucleotide sequence as set 
forth in SEQ ID NO:l (human MICAL- 1 cDNA), SEQ ID NO:3 (human MICAL-2 cDNA), 
SEQ ID NO:5 (human MICAL-3 cDNA), SEQ ID NO:13 (human MICAL-like 1 cDNA), 
or SEQ ID NO: 15 (human MICAL-like 2 cDNA), or a portion thereof. 

[0127] Polynucleotides encoding MICAL or MICAL-Like polypeptides of various 
organisms in addition to those identified herein, can be identified using well known 
procedures and algorithms based on identity (or homology) to the disclosed sequences. 
Homology or identity is often measured using sequence analysis software such as the 
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Sequence Analysis Software Package of the Genetics Computer Group (University of 
Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Such 
software matches similar sequences by assigning degrees of homology to various deletions, 
substitutions and other modifications. The terms "homology" and "identity," when used 
herein in the context of two or more nucleic acids or polypeptide sequences, refer to two or 
more sequences or subsequences that are the same or have a specified percentage of amino 
acid residues or of nucleotides that are the same when compared and aligned for maximum 
correspondence over a comparison window or designated region as measured using any 
number of sequence comparison algorithms or by manual alignment and visual inspection. 

[0128] For sequence comparison, typically one sequence acts as a reference sequence, to 
which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Default program 
parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

[0129] The term "comparison window" is used broadly herein to include reference to a 
segment of any one of the number of contiguous positions, for example, about 20 to 600 
positions, for example, amino acid or nucleotide position, usually about 50 to about 200 
positions, more usually about 100 to about 150 positions, in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequence for comparison are 
well-known in the art. Optimal alignment of sequences for comparison can be conducted, 
for example, by the local homology algorithm of Smith and Waterman (Adv. Appl. Math. 
2:482, 1981), by the homology alignment algorithm of Needleman and Wunsch (J. Mol. 
Biol. 48:443, 1970), by the search for similarity method of Person and Lipman (Proc. Natl. 
Acad. StiL USA 85:2444, 1988), each of which is incorporated herein by reference; by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI); or by manual alignment and visual inspection. Other algorithms for 
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determining homology or identity include, for example, in addition to a BLAST program 
(Basic Local Alignment Search Tool at the National Center for Biological Information), 
ALIGN, AMAS (Analysis of Multiply Aligned Sequences), AMPS (Protein Multiple 
Sequence Alignment), ASSET (Aligned Segment Statistical Evaluation Tool), BANDS, 
BESTSCOR, BIOSCAN (Biological Sequence Comparative Analysis Node), BLIMPS 
(BLocks IMProved Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, CLUSTAL 
W, CONSENSUS, LCONSENSUS, WCONSENSUS, Smith-Waterman algorithm, 
DARWIN, Las Vegas algorithm, FNAT (Forced Nucleotide Alignment Tool), Framealign, 
Framesearch, DYNAMIC, FILTER, FSAP (Fristensky Sequence Analysis Package), GAP 
(Global Alignment Program), GENAL, GIBBS, GenQuest, ISSC (Sensitive Sequence 
Comparison), LALIGN (Local Sequence Alignment), LCP (Local Content Program), 
MACAW (Multiple Alignment Construction & Analysis Workbench), MAP (Multiple 
Alignment Program), MBLKP, MBLKN, PIMA (Pattern-Induced Multi-sequence 
Alignment), SAGA (Sequence Alignment by Genetic Algorithm) and WHAT-IF. Such 
alignment programs can also be used to screen genome databases to identify polynucleotide 
sequences having substantially identical sequences. 

[0130] A number of genome databases are available for comparison through the National 
Center for Biotechnology internet site (http://www.ncbi.nlm.nih.gov/). For example, the 
NCBI site provides access to the complete genomes of human (Ventor, J.C., et al., Science 
291:1304-1351 (2001), M genitalium, M. jannaschii, H. influenzae, E. coli, yeast (S. 
cerevisiae), andZX melanogaster fAdams et al., Science 287: 2185-2195 (2000)). 

[0131] One example of a useful algorithm is BLAST and BLAST 2.0 algorithms, which 
are described by Altschul et al. {Nucleic Acids Res. 25:3389-3402, 1977; J. Mol Biol 
275:403-410, 1990, each of which is incorporated herein by reference). Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov). This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of length W in 
the query sequence, which either match or satisfy some positive-valued threshold score T 
when aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul et al., supra, 1977, 1990). These initial 
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neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0). For amino acid sequences, a scoring matrix is used to calculate the cumulative 
score. Extension of the word hits in each direction are halted when: the cumulative 
alignment score falls off by the quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The 
BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an 
expectation (E) of 10, M=5, N=4 and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as defaults a wordlength of 3, and expectations (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, Proc. Natl Acad. Set, 
USA 89:10915, 1989) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a 
comparison of both strands. 

[0132] The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, for example, Karlin and Altschul, Proc. Natl. Acad. Sci., USA 
90:5873, 1993, which is incorporated herein by reference). One measure of similarity 
provided by BLAST algorithm is the smallest sum probability (P(N)), which provides an 
indication of the probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a nucleic acid is considered similar to a 
references sequence if the smallest sum probability in a comparison of the test nucleic acid 
to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01, and 
most preferably less than about 0.001. 

[0133] In one embodiment, protein and nucleic acid sequence homologies are evaluated 
using the Basic Local Alignment Search Tool ("BLAST"). In particular, five specific 
BLAST programs are used to perform the following task: 

[0134] (1) BLASTP and BLASTS compare an amino acid query sequence against a 
protein sequence database; 
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[0135] (2) BLASTN compares a nucleotide query sequence against a nucleotide 
sequence database; 

[0136] (3) BLASTX compares the six- frame conceptual translation products of a query 
nucleotide sequence (both strands) against a protein sequence database; 

[0137] (4) TBLASTN compares a query protein sequence against a nucleotide sequence 
database translated in all six reading frames (both strands); and 

[0138] (5) TBLASTX compares the six-frame translations of a nucleotide query 
sequence against the six-frame translations of a nucleotide sequence database. 

[0139] The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs/ 1 between a query 
amino or nucleic acid sequence and a test sequence which is preferably obtained from a 
protein or nucleic acid sequence database. High-scoring segment pairs are preferably 
identified (i.e., aligned) by means of a scoring matrix, many of which are known in the art. 
Preferably, the scoring matrix used is the BLOSUM62 matrix (Gonnet et al., Science 
256:1443-1445, 1992; Henikoff and Henikoff, Proteins 17:49-61, 1993, each of which is 
incorporated herein by reference). Less preferably, the PAM or PAM250 matrices may also 
be used (Schwartz and Dayhoff, eds., "Matrices for Detecting Distance Relationships: Atlas 
of Protein Sequence and Structure" (Washington, National Biomedical Research Foundation 
1978)). BLAST programs are accessible through the U.S. National Library of Medicine, for 
example, at www.ncbi.nlm.nih. gov . 

[0140] The parameters used with the above algorithms may be adapted depending on the 
sequence length and degree of homology studied. In some embodiments, the parameters 
may be the default parameters used by the algorithms in the absence of instructions from the 
user. 

[0141] Therefore, using MICAL sequences disclosed herein, and known methods and 
databases, a polynucleotide encoding a MICAL or MICAL-Like polypeptide, or the MICAL 
or MICAL-Like polypeptide or protein can be identified from any organism. Therefore, 
MICAL polynucleotides, polypeptides, and proteins of the present invention include, for 



WO 03/066821 



PCT/US03/03551 



38 

example, mouse, rat, cow, pig, horse, dog, human, chicken, turkey, zebrafish, and other 
species. 

[0142] It should also be recognized that reference is made herein to particular peptides or 
polypeptides beginning or ending at "about" a particular amino acid residue. The term 
"about" is used in this context because it is recognized that a particular protease can cleave a 
MICAL polypeptide at or immediately adjacent to a proteolytic cleavage recognition site, or 
one or a few amino acids from the recognition site. As such, reference, for example, to a 
MICAL polypeptide having a sequence of about amino acid residues 1 to 263 of SEQ ID 
NO:2 would include an amino terminal peptide portion of MICAL that has a carboxy 
terminus ending at amino acid residue 257 to amino acid residue 269, preferably at amino 
acid residue 260 to amino acid residue 266. 

[0143] The term "peptide," "peptide portion," or polypeptide is used broadly herein to 
mean two or more amino acids linked by a peptide bond. The term "fragment" or 
"proteolytic fragment" also is used herein to refer to a product that can be produced by a 
proteolytic reaction on a polypeptide, i.e., a peptide produced upon cleavage of a peptide 
bond in the polypeptide. Although the term "proteolytic fragment" is used generally herein 
to refer to a peptide that can be produced by a proteolytic reaction, it should be recognized 
that the fragment need not necessarily be produced by a proteolytic reaction, but also can be 
produced using methods of chemical synthesis or methods of recombinant DNA technology, 
as discussed in greater detail below, to produce a synthetic peptide that is equivalent to a 
proteolytic fragment. In view of the disclosed homology of MICALs and MICAL-Like 
proteins with other proteins, it will be recognized that a polypeptide of the invention is 
characterized, in part, in that it is not present in previously disclosed members of this 
superfamily. Whether a polypeptide portion of a MICAL or MICAL-Like polypeptide is 
present in a previously disclosed protein readily can be determined using the computer 
algorithms described above. 

[0144] Generally, a peptide or polypeptide of the invention contains at least about six 
amino acids, usually contains about ten amino acids, and can contain fifteen or more amino 
acids, particularly twenty or more amino acids. It should be recognized that the terms 
"peptide" and "polypeptide" is not used herein to suggest a particular size or number of 
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amino acids comprising the molecule, and that a polypeptide of the invention can contain up 
to several amino acid residues or more. 

[0145] As used herein, the term "substantially purified" or "substantially pure" or 
"isolated" means that the molecule being referred to, for example, a polypeptide or a 
polynucleotide, is in a form that is relatively free of proteins, nucleic acids, lipids, 
carbohydrates or other materials with which it is naturally associated. Generally, a 
substantially pure polypeptide, polynucleotide, or other molecule constitutes at least twenty 
percent of a sample, generally constitutes at least about fifty percent of a sample, usually 
constitutes at least about eighty percent of a sample, and particularly constitutes about 
ninety percent or ninety-five percent or more of a sample. A determination that a peptide or 
a polynucleotide of the invention is substantially pure can be made using well known 
methods, for example, by performing electrophoresis and identifying the particular 
molecule as a relatively discrete band. A substantially pure polynucleotide, for example, 
can be obtained by cloning the polynucleotide, or by chemical or enzymatic synthesis. A 
substantially pure peptide or polypeptide can be obtained, for example, by a method of 
chemical synthesis, or using methods of protein purification, followed by proteolysis and, if 
desired, further purification by chromatographic or electrophoretic methods. 

[0146] A polypeptide of the invention can be identified by comparison to a MICAL or 
MICAL-Like sequence and determining that the amino acid sequence of the polypeptide is 
contained within the MICAL or MICAL-Like polypeptide sequence, respectively. It should 
be recognized, however, that a polypeptide of the invention need not be identical to a 
corresponding amino acid sequence of MICAL or a MICAL-Like polypeptide. Thus, a 
polypeptide of the invention can correspond to an amino acid sequence of a MICAL 
polypeptide, for example, but can vary from a naturally occurring sequence, for example, by 
containing one or more D-amino acids in place of a corresponding L-amino acid; or by 
containing one or more amino acid analogs, for example, an amino acid that has been 
derivatized or otherwise modified at its reactive side chain. Similarly, one or more peptide 
bonds in the polypeptide can be modified. In addition, a reactive group at the amino 
terminus or the carboxy terminus or both can be modified. Such polypeptides can be 
modified, for example, to have improved stability to a protease, an oxidizing agent or other 
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reactive material the peptide may encounter in a biological environment, and, therefore, can 
be particularly useful in performing a method of the invention. Of course, the polypeptides 
can be modified to have decreased stability in a biological environment such that the period 
of time the polypeptide is active in the environment is reduced. 

[0147] The sequence of a MICAL or MICAL-Like polypeptide of the invention also can 
be modified by incorporating a conservative amino acid substitution for one or a few amino 
acids in the polypeptide. Conservative amino acid substitutions include the replacement of 
one amino acid residue with another amino acid residue having relatively the same chemical 
characteristics, for example, the substitution of one hydrophobic residue such as isoleucine, 
valine, leucine or methionine for another, or the substitution of one polar residue for 
another, for example, substitution of arginine for lysine; or of glutamic for aspartic acid; or 
of glutamine for asparagine; or the like. Examples of positions of a MICAL polypeptide 
that can be modified are evident from examination of differences in the disclosed MICAL 
sequences. 

[0148] The present invention also provides a substantially purified proteolytic fragment 
of a MICAL polypeptide or a functional peptide portion thereof. Apeptide portion of a 
MICAL polypeptide that is equivalent to a proteolytic fragment of a MICAL can be 
produced by a chemical method or a recombinant DNA method. 

[0149] The term "polynucleotide" is used broadly herein to mean a sequence of two or 
more deoxyribonucleotides or ribonucleotides that are linked together by a phosphodiester 
bond. As such, the term "polynucleotide" includes RNA and DNA, which can be a gene or 
a portion thereof, a cDNA, a synthetic polydeoxyribonucleic acid sequence, or the like, and 
can be single stranded or double stranded, as well as a DNA/RNA hybrid. Furthermore, the 
term "polynucleotide" as used herein includes naturally occurring nucleic acid molecules, 
which can be isolated from a cell, as well as synthetic molecules, which can be prepared, for 
example, by methods of chemical synthesis or by enzymatic methods such as by the 
polymerase chain reaction (PCR). In various embodiments, a polynucleotide of the 
invention can contain nucleoside or nucleotide analogs, or a backbone bond other than a 
phosphodiester bond (see above). 
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[0150] In general, the nucleotides that make up a polynucleotide are naturally occurring 
deoxyribonucleotides, such as adenine, cytosine, guanine or thymine linked to 
2-deoxyribose, or ribonucleotides such as adenine, cytosine, guanine or uracil linked to 
ribose. However, a polynucleotide also can contain nucleotide analogs, including 
non-naturally occurring synthetic nucleotides or modified naturally occurring nucleotides. 
Such nucleotide analogs are well known in the art and commercially available, as are 
polynucleotides containing such nucleotide analogs (Lin et al., Nucl Acids Res. 22:5220- 
5234 (1994); Jellinek et al., Biochemistry 34:11363-11372 (1995); Pagratis et al., Nature 
Biotechnol 15:68-73 (1997), each of which is incorporated herein by reference). 

[0151] The covalent bond linking the nucleotides of a polynucleotide generally is a 
phosphodiester bond. However, the covalent bond also can be any of numerous other 
bonds, including a thiodiester bond, a phosphorothioate bond, a peptide-like bond or any 
other bond known to those in the art as useful for linking nucleotides to produce synthetic 
polynucleotides (see, for example, Tarn et al., Nucl Acids Res. 22:977-986 (1994); Ecker 
and Crooke, BioTechnology 13:351360 (1995), each of which is incorporated herein by 
reference). The incorporation of non-naturally occurring nucleotide analogs or bonds 
linking the nucleotides or analogs can be particularly useful where the polynucleotide is to 
be exposed to an environment that can contain a nucleolytic activity, including, for example, 
a tissue culture medium or upon administration to a living subject, since the modified 
polynucleotides can be less susceptible to degradation. 

[0152] A polynucleotide comprising naturally occurring nucleotides and phosphodiester 
bonds can be chemically synthesized or can be produced using recombinant DNA methods, 
using an appropriate polynucleotide as a template. Li comparison, a polynucleotide 
comprising nucleotide analogs or covalent bonds other than phosphodiester bonds generally 
will be chemically synthesized, although an enzyme such as T7 polymerase can incorporate 
certain types of nucleotide analogs into a polynucleotide and, therefore, can be used to 
produce such a polynucleotide recombinantly from an appropriate template (Jellinek et al, 
supra,, 1995). 

[0153] Where a polynucleotide encodes a polypeptide, for example, a polypeptide 
portion of a MIC AL or a polypeptide agent, the coding sequence generally is contained in a 
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vector and is operatively linked to appropriate regulatory elements, including, if desired, a 
tissue specific promoter or enhancer. The encoded peptide can be further operatively 
linked, for example, to a peptide tag such as a His-6 tag or the like, which can facilitate 
identification of expression of the agent in the target cell. A polyhistidine tag peptide such as 
His-6 can be detected using a divalent cation such as nickel ion, cobalt ion, or the like. 
Additional peptide tags include, for example, a FLAG epitope, which can be detected using an 
anti-FLAG antibody (see, for example, Hopp et al., BioTechnology 6:1204 (1988); U.S. 
Patent No. 5,01 1,912, each of which is incorporated herein by reference); a c-myc epitope, 
which can be detected using an antibody specific for the epitope; biotin, which can be 
detected using streptavidin or avidin; and glutathione S-transferase, which can be detected 
using glutathione. Such tags can provide the additional advantage that they can facilitate 
isolation of the operatively linked peptide or peptide agent, for example, where it is desired 
to obtain a substantially purified peptide corresponding to a proteolytic fragment of a 
MICAL or MICAL-Like polypeptide. 

[0154] As used herein, the term "operatively linked" or "operatively associated" means 
that two or more molecules are positioned with respect to each other such that they act as a 
single unit and effect a function attributable to one or both molecules or a combination 
thereof. For example, a polynucleotide sequence encoding a polypeptide of the invention 
can be operatively linked to a regulatory element, in which case the regulatory element 
confers its regulatory effect on the polynucleotide similarly to the way in which the 
regulatory element would effect a polynucleotide sequence with which it normally is 
associated with in a cell. A first polynucleotide coding sequence also can be operatively 
linked to a second (or more) coding sequence such that a chimeric polypeptide can be 
expressed from the operatively linked coding sequences. The chimeric polypeptide can be a 
fusion polypeptide, in which the two (or more) encoded peptides are translated into a single 
polypeptide, i.e., are covalently bound through a peptide bond; or can be translated as two 
discrete peptides that, upon translation, can operatively associate with each other to form a 
stable complex. 

[0155] A polynucleotide of the invention, including a polynucleotide agent useful in 
performing a method of the invention, can be contacted directly with a target cell. For 
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example, oligonucleotides useful as axitisense molecules, ribozymes, or triplexing agents 
can be directly contacted with a target cell, whereupon they enter the cell and affect their 
function. A polynucleotide agent also can interact specifically with a polypeptide, for 
example, a MICAL polypeptide, thereby altering the ability of the MICAL to interact 
specifically with a plexin. Such polynucleotides, as well as methods of making and 
identifying such polynucleotides, are disclosed herein or otherwise well known in the art 
(see, for example, O'Connell et al., Proc. Natl Acad. Set, USA 93:5883-5887, 1996; Tuerk 
and Gold, Science 249:505-510, 1990; Gold et al., Ann. Rev. Biochem. 64:763-797, 1995; 
each of which is incorporated herein by reference). 

[0156] A polynucleotide of the invention, which can encode a MICAL or MICAL-Like 
polypeptide or can encode a mutant MICAL or MICAL-Like polypeptide or functional 
peptide portion thereof, or can be a polynucleotide agent useful in performing a method of 
the invention, can be contained in a vector, which can facilitate manipulation of the 
polynucleotide, including introduction of the polynucleotide into a target cell. The vector 
can be a cloning vector, which is useful for maintaining the polynucleotide, or can be an 
expression vector, which contains, in addition to the polynucleotide, regulatory elements 
useful for expressing the polynucleotide and, where the polynucleotide encodes a peptide, 
for expressing the encoded peptide in a particular cell. An expression vector can contain the 
expression elements necessary to achieve, for example, sustained transcription of the 
encoding polynucleotide, or the regulatory elements can be operatively linked to the 
polynucleotide prior to its being cloned into the vector. 

[0157] An expression vector, or the polynucleotide included on the vector generally 
contains or encodes a promoter sequence, which can provide constitutive or, if desired, 
inducible or tissue specific or developmental stage specific expression of the encoding 
polynucleotide, a poly-A recognition sequence, and a ribosome recognition site or internal 
ribosome entry site, or other regulatory elements such as an enhancer, which can be tissue 
specific. The vector also can contain elements required for replication in a prokaryotic or 
eukaryotic host system or both, as desired. Such vectors, which include plasmid vectors and 
viral vectors such as bacteriophage, baculovirus, retrovirus, lentivirus, adenovirus, vaccinia 
virus, semliki forest virus and adeno-associated virus vectors, are well known and can be 
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purchased from a commercial source (Promega, Madison WI; Stratagene, La Jolla CA; 
GIBCO/BRL, Gaithersburg MD) or can be constructed by one skilled in the art (see, for 
example, Meth. Enzymol., Vol. 185, Goeddel, ed. (Academic Press, Inc., 1990); Jolly, Cane. 
Gene Then 1:51-64, 1994; Flotte, J. Bioenerg. Biomemb. 25:37-42, 1993; Kirshenbaum et 
aL, J. Clin. Invest 92:381-387, 1993; each of which is incorporated herein by reference). 

[0158] A tetracycline (tet) inducible promoter can be particularly useful for driving 
expression of a polynucleotide of the invention, for example, a polynucleotide encoding a 
dominant negative form of a MICAL polypeptide. Upon administration of tetracycline, or a 
tetracycline analog, to a subject containing a polynucleotide operatively linked to a tet 
inducible promoter, expression of the encoded polypeptide is induced, whereby the 
polypeptide can effect its activity, for example, whereby a polypeptide agent can reduce or 
inhibit semaphorin mediated axonal repulsion. Such a method can be used, for example, to 
induce axon formation after spinal cord injury. 

[0159] The polynucleotide also can be operatively linked to tissue specific regulatory 
element, for example, a neuron specific regulatory element, such that expression of an 
encoded peptide is restricted to the neurons in an individual, or to neurons in a mixed 
population of cells in culture. Neuron specific regulatory elements are well known in the art 
as illustrated in the Examples section (See also, e.g., Nelson, S.B., et al., Mol Endocrinol 
14:1509-22 (2000); and Navarro et al., Gene Ther. 6:1884-92 (1999)). For example, after a 
spinal cord injury, a vector that encodes a dominant negative MICAL polypeptide 
operatively linked to a neuron-specific promoter, can be delivered to the site of spinal cord 
injury. Expression of the dominant negative mutant in neurons is expected to inhibit 
MICAL regulatory activity, thereby inhibiting semaphorin-mediated axon repulsion. This 
inhibition permits axons to regrow and migrate to reach new targets 

[0160] Viral expression vectors can be particularly useful for introducing a 
polynucleotide into a cell, particularly a cell in a subject. Viral vectors provide the 
advantage that they can infect host cells with relatively high efficiency and can infect 
specific cell types. For example, a polynucleotide encoding a MICAL polypeptide, or 
functional peptide portion thereof can be cloned into a baculovirus vector, which then can 
be used to infect an insect host cell, thereby providing a means to produce large amounts of 
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the encoded MICAL or MICAL-Like protein. The viral vector also can be derived from a 
virus that infects cells of an organism of interest, for example, vertebrate host cells such as 
mammalian, avian or piscine host cells. Viral vectors can be particularly useful for 
introducing a polynucleotide useful in performing a method of the invention into a target 
cell. Viral vectors have been developed for use in particular host systems, particularly 
mammalian systems and include, for example, retroviral vectors, other lentivirus vectors 
such as those based on the human immunodeficiency virus (HIV), adenovirus vectors, 
adeno-associated virus vectors, herpesvirus vectors, vaccinia virus vectors, and the like (see 
Miller and Rosman, BioTechniques 7:980-990, 1992; Anderson et al., Nature 392:25-30 
Suppl., 1998; Verma and Somia, Nature 389:239-242, 1997; Wilson, New Engl. J. Med. 
334:1 185-1 187 (1996), each of which is incorporated herein by reference). 

[0161] When retroviruses, for example, are used for gene transfer, replication competent 
retroviruses theoretically can develop due to recombination of retroviral vector and viral 
gene sequences in the packaging cell line utilized to produce the retroviral vector! 
Packaging cell lines in which the production of replication competent virus by 
recombination has been reduced or eliminated can be used to minimize the likelihood that a 
replication competent retrovirus will be produced. All retroviral vector supernatants used to 
infect cells are screened for replication competent virus by standard assays such as PCR and 
reverse transcriptase assays. Retroviral vectors allow for integration of a heterologous gene 
into a host cell genome, which allows for the gene to be passed to daughter cells following 
cell division. 

[0162] A polynucleotide, which can be contained in a vector, can be introduced into a 
cell by any of a variety of methods known in the art (Sambrook et al., Molecular Cloning: 
A laboratory manual (Cold Spring Harbor Laboratory Press 1989); Ausubel et al., Current 
Protocols in Molecular Biology, John Wiley and Sons, Baltimore, MD (1987, and 
supplements through 1995), each of which is incorporated herein by reference). Such 
methods include, for example, transfection, lipofection, microinjection, electroporation and, 
with viral vectors, infection; and can include the use of liposomes, microemulsions or the 
like, which can facilitate introduction of the polynucleotide into the cell and can protect the 
polynucleotide from degradation prior to its introduction into the cell. The selection of a 
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particular method will depend, for example, on the cell into which the polynucleotide is to 
be introduced, as well as whether the cell is isolated in culture, or is in a tissue or organ in 
culture or in situ, 

[0163] Introduction of a polynucleotide into a cell by infection with a viral vector is 
particularly advantageous in that it can efficiently introduce the nucleic acid molecule into a 
cell ex vivo or in vivo (see, for example, U.S. Patent No. 5,399,346, which is incorporated 
herein by reference). Moreover, viruses are very specialized and can be selected as vectors 
based on an ability to infect and propagate in one or a few specific cell types. Thus, their 
natural specificity can be used to target the nucleic acid molecule contained in the vector to 
specific cell types. As such, a vector based on an HIV can be used to infect T cells, a vector 
based on an adenovirus can be used, for example, to infect respiratory epithelial cells, a 
vector based on a herpesvirus can be used to infect neuronal cells, and the like. Other 
vectors, such as adeno-associated viruses can have greater host cell range and, therefore, 
can be used to infect various cell types, although viral or non-viral vectors also can be 
modified with specific receptors or ligands to alter target specificity through receptor 
mediated events. 

[0164] Thus, a polynucleotide of the invention can be a naturally occurring, synthetic, or 
intentionally manipulated polynucleotide. For example, portions of the mRNA sequence 
can be altered due to alternate RNA splicing patterns or the use of alternate promoters for 
RNA transcription. As another example, the polynucleotide can be subjected to site 
directed mutagenesis. The polynucleotide also can be antisense nucleotide sequence. 
MICAL and MICAL-Like polynucleotides (i.e., polynucleotides that encode a MICAL 
polypeptide or a MICAL-Like polypeptide) of the invention include sequences that are 
degenerate as a result of the genetic code. There are 20 natural amino acids, most of which 
are specified by more than one codon. Therefore, all degenerate nucleotide sequences are 
included within the invention, provided the amino acid sequence of a MICAL polypeptide 
or a MICAL-Like encoded by the polynucleotide is functionally unchanged. 

[0165] Oligonucleotide portions of a polynucleotide encoding a MICAL polypeptide or a 
MICAL-Like polypeptide of the invention also are encompassed within the present 
invention. Such oligonucleotides generally are at least about 15 bases in length, which is 
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sufficient to permit the oligonucleotide to selectively hybridize to a polynucleotide encoding 
the MICAL or MICAL-Like polypeptide, and can be at least about 18 nucleotides or 21 
nucleotides or more in length. As used herein, the term "selective hybridization" or 
"selectively hybridize" refers to hybridization under moderately stringent or highly stringent 
physiological conditions, which can distinguish related nucleotide sequences from unrelated 
nucleotide sequences. 

[0166] In nucleic acid hybridization reactions, the conditions used to achieve a particular 
level of stringency will vary, depending on the nature of the nucleic acids being hybridized. 
For example, the length, degree of complementarity, nucleotide sequence composition (for 
example, relative GC:AT content), and nucleic acid type, i.e., whether the oligonucleotide 
or the target nucleic acid sequence is DNA or RNA, can be considered in selecting 
hybridization conditions. An additional consideration is whether one of the nucleic acids is 
immobilized, for example, on a filter. Methods for selecting appropriate stringency 
conditions can be determined empirically or estimated using various formulas, and are well 
known in the art (see, for example, Sambrook et al., supra, 1989). 

[0167] An example of progressively higher stringency conditions is as follows: 
2X SSC/0.1% SDS at about room temperature (hybridization conditions); 0.2X SSC/0.1% 
SDS at about room temperature (low stringency conditions); 0.2X SSC/0.1% SDS at about 
42°C (moderate stringency conditions); and 0.1X SSC at about 68°C (high stringency 
conditions). Washing can be carried out using only one of these conditions, for example, 
high stringency conditions, or each of the conditions can be used, for example, for 10 to 15 
minutes each, in the order listed above, repeating any or all of the steps listed. 

[0168] A MICAL or MICAL-Like polypeptide-encoding polynucleotide of the invention 
can be obtained by any of several methods. For example, the polynucleotide can be isolated 
using hybridization or computer based techniques, as are well known in the art. These 
methods include, but are not limited to, 1) hybridization of genomic or cDNA libraries with 
probes to detect homologous nucleotide sequences; 2) antibody screening of expression 
libraries to detect cloned DNA fragments with shared structural features; 3) polymerase 
chain reaction (PGR) on genomic DNA or cDNA using primers capable of annealing to the 
DNA sequence of interest; 4) computer searches of sequence databases for similar 
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sequences (see above); 5) differential screening of a subtracted DNA library; and 6) two 
hybrid assays using, for example, a MICAL polypeptide in one of the hybrids. 

[0169] A polynucleotide of the invention, for example, a polynucleotide encoding a 
MICAL, can be derived from a vertebrate species, including a mammalian, avian, or piscine 
species, or from an invertebrate species. Screening procedures that rely on nucleic acid 
hybridization allow the isolation any gene sequence from any organism, provided the 
appropriate probe is available. Oligonucleotide probes that correspond to a part of the 
sequence encoding the protein in question can be synthesized chemically. Hybridization is 
particularly useful in the detection of cDNA clones derived from sources where an 
extremely low amount of mRNA sequences relating to the polypeptide of interest are 
present. Thus, by using stringent hybridization conditions directed to avoid nonspecific 
binding, autoradiographic visualization can be used to identify a specific cDNA clone by 
the hybridization of the target DNA to an oligonucleotide probe in the mixture that is the 
complete complement of the target nucleic acid (Wallace et al., Nucl. Acid Res., 9:879, 
1981, which is incorporated herein by reference). Alternatively, a subtractive library can be 
used, thereby eliminating nonspecific cDNA clones. 

[0170] When the entire amino acid sequence of a desired polypeptide is not known, the 
direct synthesis of DNA sequences is not possible and the method of choice is the synthesis 
of cDNA sequences. Among the standard procedures for isolating cDNA sequences of 
interest is the formation of cDNA libraries prepared in plasmids or phage, wherein the 
libraries are derived from reverse transcription of mRNA that is abundant in donor cells 
having a high level of genetic expression. When used in combination with polymerase 
chain reaction technology, even rare expression products can be cloned. Where significant 
portions of the amino acid sequence of the polypeptide are known, the production of labeled 
single stranded or double stranded DNA or RNA probe sequences duplicating a sequence 
putatively present in the target cDNA can be employed in hybridization procedures carried 
out on cloned copies of the cDNA, which have been denatured into a single stranded form 
(Jay et aL, Nucl Acid Res., 1 1 :2325, 1983, which is incorporated herein by reference). 

[0171] A cDNA expression library, such as a lambda gtl 1 library, can be screened for 
MICAL or MICAL-Like polypeptides using an antibody specific for a MICAL or MICAL- 
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Like polypeptide. The antibody can be polyclonal or monoclonal, and can be used to detect 
expression product indicative of the presence of a MICAL or MICAL-Like polypeptide 
encoding cDNA. Such an expression library also can be screened with a MICAL or 
MICAL-Like polypeptide to identify a clone encoding at least a portion of a MICAL or 
MICAL-Like polypeptide binding domain of a plexin or other protein that interacts with 
MICAL or the MICAL-Like polypeptide. 

[0172] Polynucleotides encoding mutant MICAL and MICAL-Like polypeptides are 
also encompassed within the invention. An alteration in a polynucleotide encoding a 
MICAL or MICAL-Like protein can be an intragenic mutation such as point mutation, 
nonsense (STOP) mutation, missense mutation, splice site mutation or frameshift, or can be 
a heterozygous or homozygous deletion, and can be a naturally occurring mutation or can be 
engineered using recombinant DNA methods, for example. Such alterations can be detected 
using standard methods known to those of skill in the art, including, but not limited to, 
nucleotide sequence analysis, Southern blot analysis, a PGR based analysis such as 
multiplex PCR or sequence tagged sites (STS) analysis, or in situ hybridization analysis. 
MICAL and MICAL-Like polypeptides can be analyzed by standard SDS-PAGE, 
immunoprecipitation analysis, western blot analysis, or the like. 

[0173] A polynucleotide encoding a MICAL or a MICAL-Like polypeptide can be 
expressed in vitro by introducing the polynucleotide into a suitable host cell. "Host cells" 
can be any cells in which the particular vector can be propagated, and, where appropriate, in 
which a polynucleotide contained in the vector can be expressed. The term "host cells" 
includes any progeny of an original host cell. It is understood that all progeny of the host 
cell may not be identical to the parental cell due, for example, to mutations that occur 
during replication. However, such progeny are included when the term "host cell" is used. 
Methods of obtaining a host cell that transiently or stably contains an introduced 
polynucleotide of the invention are well known in the art. In one aspect, the present 
invention provides host cell that includes a polynucleotide encoding a MICAL polypeptide 
according to the present invention, operably linked to a heterologous promoter. 

[0174] In certain aspects of embodiments of the present invention, a cell is a mammalian 
cell, for example a human cell. 
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[0175] A polynucleotide encoding a MICAL or a MICAL-Like polypeptide of the 
invention can be inserted into a vector, which can be a cloning vector or a recombinant 
expression vector. The term "recombinant expression vector" refers to a plasmid, virus or 
other vehicle known in the art that has been manipulated by insertion or incorporation of a 
polynucleotide, particularly, with respect to the present invention, a polynucleotide 
encoding all or a peptide portion of a MICAL or a MICAL-Like polypeptide. Such 
expression vectors contain a promoter sequence, which facilitates the efficient transcription 
of the inserted genetic sequence of the host. The expression vector generally contains an 
origin of replication, a promoter, as well as specific genes which allow phenotypic selection 
of the transformed cells. Vectors suitable for use in the present invention include, but are 
not limited to, the T7-based expression vector for expression in bacteria (Rosenberg, et al., 
Gene 56:125, 1987), the pMSXND expression vector for expression in mammalian cells 
(Lee and Nathans, J. Biol Chem. 263:3521, 1988) and baculovirus-derived vectors for 
expression in insect cells. The DNA segment can be present in the vector operably linked to 
regulatory elements, for example, a promoter, which can be a T7 promoter, metallothionein 
I promoter, polyhedrin promoter, or other promoter as desired, particularly tissue specific 
promoters or inducible promoters. 

[0176] A polynucleotide sequence encoding a MICAL or a MICAL-Like polypeptide 
can be expressed in either prokaryotes or eukaryotes. Hosts can include microbial, yeast, 
insect and mammalian organisms. Methods of expressing polynucleotides having 
eukaryotic or viral sequences in prokaryotes are well known in the art, as are biologically 
functional viral and plasmid DNA vectors capable of expression and replication in a host. 
Methods for constructing an expression vector containing a polynucleotide of the invention 
are well known, as are factors to be considered in selecting transcriptional or translational 
control signals, including, for example, whether the polynucleotide is to be expressed 
preferentially in a particular cell type or under particular conditions (see, for example, 
Sambrook et al., supra, 1989). 

[0177] A variety of host cell/expression vector systems can be utilized to express a 
MICAL or a MICAL-Like polypeptide coding sequence, including, but not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage DNA, 
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plasmid DNA or cosmid DNA expression vectors; yeast cells transformed with recombinant 
yeast expression vectors; plant cell systems infected with recombinant virus expression 
vectors such as a cauliflower mosaic virus or tobacco mosaic virus, or transformed with 
recombinant plasmid expression vector such as a Ti plasmid; insect cells infected with 
recombinant virus expression vectors such as a baculovirus; animal cell systems infected 
with recombinant virus expression vectors such as a retrovirus, adenovirus or vaccinia virus ■ 
vector; and transformed animal cell systems genetically engineered for stable expression. 
Where the expressed MICAL or a MICAL-Like polypeptide is post-translationally 
modified, for example, by glycosylation, it can be particularly advantageous to select a host 
cell/expression vector system that can effect the desired modification, for example, a 
mammalian host cell/expression vector system. 

[0178] Depending on the host cell/vector system utilized, any of a number of suitable 
transcription and translation elements, including constitutive and inducible promoters, 
transcription enhancer elements, transcription terminators, and the like can be used in the 
expression vector (Bitter et al., Meth. EnzymoL 153:516-544, 1987). For example, when 
cloning in bacterial systems, inducible promoters such as pL of bacteriophage X, plac, ptrp, 
ptac (ptrp -lac hybrid promoter) and the like can be used. When cloning in mammalian cell 
systems, promoters derived from the genome of mammalian cells, for example, a human or 
mouse metallothionein promoter, or from mammalian viruses, for example, a retrovirus 
long terminal repeat, an adenovirus late promoter or a vaccinia virus 7.5K promoter, can be 
used. Promoters produced by recombinant DNA or synthetic techniques can also be used to 
provide for transcription of the inserted MICAL or a MICAL-Like polypeptide coding 
sequence. 

[0179] In yeast cells, a number of vectors containing constitutive or inducible promoters 
can be used (see Ausubel et al., supra, 1987, see chapter 13; Grant et al., Meth. EnzymoL 
153:516-544, 1987; Glover, DNA Cloning Vol. II (IRL Press, 1986), see chapter 3; Bitter, 
Meth. EnzymoL 152:673-684, 1987; see, also, The Molecular Biology of the Yeast 
Saccharomyces (Eds., Strathern et al., Cold Spring Harbor Laboratory Press, 1982), Vols. I 
and II). A constitutive yeast promoter such as ADH or LEU2 or an inducible promoter such 
as GAL can be used (Rothstein, DNA Cloning Vol. II {supra, 1986), chapter 3). 
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Alternatively, vectors can be used which promote integration of foreign DNA sequences 
into the yeast chromosome. 

[0180] Eukaryotic systems, particularly mammalian expression systems, allow for 
proper post-translational modifications of expressed mammalian proteins. Eukaryotic cells 
which possess the cellular machinery for proper processing of the primary transcript, 
glycosylation, phosphorylation, and advantageously, plasma membrane insertion of the 
gene product can be used as host cells for the expression of a MICAL or MICAL-Like 
polypeptide, or functional peptide portion thereof. 

[0181] Mammalian cell systems which utilize recombinant viruses or viral elements to 
direct expression can be engineered. For example, when using adenovirus expression 
vectors, the MICAL or MICAL-Like polypeptide coding sequence can be ligated to an 
adenovirus transcription/translation control complex, e.g., the late promoter and tripartite 
leader sequence. Alternatively, the vaccinia virus 7.5K promoter can be used (Mackett et 
al., Proc. Natl Acad. Set, USA 79:7415-7419, 1982; Mackett et al., J. Virol 49:857-864, 
1984; Panicali et al., Proc. Natl Acad. Set, USA 79:4927-4931, 1982). Particularly useful 
are bovine papilloma virus vectors, which can replicate as extrachromosomal elements 
(Sarver et al., Mol Cell Biol 1 :486, 1981). Shortly after entry of this DNA into mouse 
cells, the plasmid replicates to about 100 to 200 copies per cell. Transcription of the 
inserted cDNA does not require integration of the plasmid into the host cell chromosome, 
thereby yielding a high level of expression. These vectors can be used for stable expression 
by including a selectable marker in the plasmid, such as, for example, the neo gene. 
Alternatively, the retroviral genome can be modified for use as a vector capable of 
introducing and directing the expression of the MICAL or MICAL-Like gene in host cells 
(Cone and Mulligan, Proc. Natl Acad. Scl f USA 81:6349-6353, 1984). High level 
expression can also be achieved using inducible promoters, including, but not limited to, the 
metallothionein HA promoter and heat shock promoters. 

[0182] For long term, high yield production of recombinant proteins, stable expression is 
preferred. Rather than using expression vectors which contain viral origins of replication, 
host cells can be transformed with the MICAL or a MICAL-Like polypeptide encoding 
cDNA controlled by appropriate expression control elements such as promoter, enhancer, 
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sequences, transcription terminators, and polyadenylation sites, and a selectable marker. 
The selectable marker in the recombinant plasmid can confer resistance to the selection, and 
allows cells to stably integrate the plasmid into their chromosomes and grow to form foci, 
which, in turn can be cloned and expanded into cell lines. For example, following the 
introduction of foreign DNA, engineered cells can be allowed to grow for 1 to 2 days in an 
enriched media, and then are switched to a selective media. A number of selection systems 
can be used, including, but not limited to, the herpes simplex virus thymidine kinase 
(Wigler et al., Cell 1 1 :223, 1977), hypoxanthine-guanine phosphoribosyltransferase 
(Szybalska and Szybalski, Proc. Natl Acad. Set, USA 48:2026, 1982), and adenine 
phosphoribosyltransferase (Lowy, et al., Cell 22:817, 1980) genes can be employed in tk", 
hgprt" or aprf cells respectively. Also, antimetabolite resistance can be used as the basis of 
selection for dhfr, which confers resistance to methotrexate (Wigler, et al., Proc, Natl Acad. 
Set USA 77:3567, 1980; O'Hare et al., Proc. Natl. Acad. Sci., USA 78: 1527, 1981); gpt, 
which confers resistance to mycophenolic acid (Mulligan and Berg, Proc. Natl. Acad. Sci., 
USA 78:2072, 1981); neo, which confers resistance to the aminoglycoside G-418 
(Colberre-Garapin et al., J. Mol Biol 150:1, 1981); and hygro, which confers resistance to 
hygromycin (Santerre et al., Gene 30:147, 1984) genes. Additional selectable genes, 
including trpB, which allows cells to utilize indole in place of tryptophan; hisD, which 
allows cells to utilize histinol in place of histidine (Hartman and Mulligan, Proc. Natl. 
Acad. Sci., USA 85:8047, 1988); and ODC (ornithine decarboxylase) which confers 
resistance to the ornithine decarboxylase inhibitor, 2-(difluoromethyl)-DL-ornithine, DFMO 
(McConlogue, Curr. Comm. Mol. Biol. (Cold Spring Harbor Laboratory Press, 1987), also 
have been described. 

[0183] When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate coprecipitates, conventional mechanical procedures such as microinjection, 
electroporation, insertion of a plasmid encased in liposomes, or virus vectors can be used. 
Eukaryotic cells can also be cotransformed with DNA sequences encoding the MICAL or a 
MICAL-Like polypeptides of the invention, and a second foreign DNA molecule encoding 
a selectable phenotype, such as the herpes simplex thymidine kinase gene. Another method 
is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine papilloma 
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virus, to transiently infect or transform eukaryotic cells and express the protein. (Gluzman 5 
Eukaryotic Viral Vectors (Cold Spring Harbor Laboratory Press, 1982)). 

[0184] The invention also provides stable recombinant cell lines, the cells of which 
express MICAL or a MICAL-Like polypeptides and contain DNA that encodes MICAL or a 
MICAL-Like polypeptides. Suitable cell types include, but are not limited to, NIH 3T3 
cells (murine), C2C12 cells, L6 cells, and P19 cells. C2C12 and L6 myoblasts differentiate 
spontaneously in culture and form myotubes depending on the particular growth conditions 
(Yaffe and Saxel, Nature 270:725-727, 1977; Yaffe, Proc. Natl Acad. Sci., USA 61:477- 
483, 1968). PI 9 is an embryonal carcinoma cell line. Such cells are described, for 
example, in the Cell Line Catalog of the American Type Culture Collection (ATCC). These 
cells can be stably transformed using well known methods (see, for example, Ausubel et al., 
supra, 1995, see sections 9.5.1-9.5.6). 

[0185] A MICAL or a MICAL-Like polypeptide can be expressed from a recombinant 
polynucleotide of the invention using inducible or constitutive regulatory elements, as 
described herein. The desired protein encoding sequence and an operably linked promoter 
can be introduced into a recipient cell either as a non-replicating DNA (or RNA) molecule, 
which can either be a linear molecule or a covalently closed circular molecule. Expression 
of the desired molecule can occur due to transient expression of the introduced sequence, or 
the polynucleotide can be stably maintained in the cell, for example, by integration into a 
host cell chromosome, thus allowing a more permanent expression. Accordingly, the cells 
can be stably or transiently transformed (transfected) cells. 

[0186] An example of a vector that can be employed is one which is capable of 
integrating the desired gene sequences into the host cell chromosome. Cells which have 
stably integrated the introduced DNA into their chromosomes can be selected by also 
introducing one or more markers which allow for selection of host cells which contain the 
expression vector. The marker can complement an auxotrophy in the host such as leu2, or 
ura3, which are common yeast auxotrophic markers; can confer a biocide resistance, for 
example, to an antibiotic or to heavy metal ions such as copper, or the like. The selectable 
marker gene can either be directly linked to the DNA gene sequences to be expressed, or 
can be introduced into the same cell by cotransfection. 
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[0187] The introduced sequence can be incorporated into a plasmid or viral vector 
capable of autonomous replication in the recipient host. Any of a variety of vectors can be 
employed for this purpose. Factors of importance in selecting a particular plasmid or viral 
vector include the ease with which recipient cells that contain the vector can be recognized 
and selected from those cells that do not contain the vector; the number of copies of the 
vector desired in a particular host cell; and whether it is desirable to be able to "shuttle" the 
vector between host cells of different species. 

[0188] For a mammalian host, several vector systems are available for expression. One 
class of vectors utilizes DNA elements that provide autonomously replicating extra- 
chromosomal plasmids derived from animal viruses, for example, a bovine papilloma virus, 
polyoma virus, adenovirus, or SV40 virus. A second class of vectors includes vaccinia 
virus expression vectors. A third class of vectors relies upon the integration of the desired 
gene sequences into the host chromosome. Cells that have stably integrated the introduced 
DNA into their chromosomes can be selected by also introducing one or more markers 
genes (as described above), which allow selection of host cells that contain the expression 
vector. The selectable marker gene can be directly linked to the DNA sequences to be 
expressed, or introduced into the same cell by cotransfection. Additional elements can be 
included to provide for optimal synthesis of an encoded mRNA or peptide, including, for 
example, splice signals, transcription promoters or enhancers, and transcription or 
translation termination signals. cDNA expression vectors incorporating appropriate 
regulatory elements are well known in the art (see, for example, Okayama, Mol. Cell. Biol. 
3:280, 1983). 

[0189] Once the vector or DNA sequence containing the construct has been prepared for 
expression, the DNA construct can be introduced into an appropriate host. Various methods 
can be used for introducing the polynucleotide into a cell, including, for example, methods 
of transfection or transformation such as protoplast fusion, calcium phosphate precipitation, 
and electroporation or other conventional techniques, for example, infection where the 
vector is a viral vector. 

[0190] The invention also provides transgenic non-human animals that have cells that 
constitutively express a recombinant MICAL or a MICAL-Like polypeptide or that have 
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recombinants inactivated MICAL or MICAL-Like function. In certain aspects, transgenic 
animals that constitutively express a recombinant MICAL or MICAL-Like protein can be 
expressed with a tag sequence that can be used to facilitate immunoprecipitation of the 
MICAL or MICAL-Like polypeptide. Such transgenic non-human animals can be used for 
example, to facilitate the identification of agents which bind the MICAL or MICAL-Like 
polypeptides. Alternatively, the transgenic non-human organism can express a mutant 
transgenic MICAL in the central nervous system or peripheral nervous system, to identify 
mutant MICAL polypeptides that affect axonal guidance. 

[0191] Accordingly, in one aspect, the transgenic animal is a transgenic non-human 
organism whose genome includes a transgenic DNA sequence that includes a 
polynucleotide that encodes a mutant MICAL polypeptide operably linked to a promoter 
that is active in the central nervous system and/or peripheral nervous system, wherein the 
mouse expresses the transgenic polynucleotide in the central nervous system and/or 
peripheral nervous system, and wherein expression levels of transgenic polynucleotide are 
sufficient to effect an axonal guidance phenotype of the non-human organism. 

[0192] In another aspect, the transgenic animal is a non-human transgenic animal having 
a genome comprising a transgene comprising a nucleotide sequence encoding a MICAL 
polypeptide operably linked to a heterologous promoter. The non-human transgenic animal 
expresses the transgenic polynucleotide in the central nervous system and/or peripheral 
nervous system at expression levels sufficient to effect an axonal guidance phenotype of the 
non-human organism. In one aspect, the MICAL polypeptide is ectopically expressed. The 
MICAL polypeptide is expressed in the transgenic animal at a greater level in one or more 
cells of the non-human transgenic animal than the MICAL polypeptide is expressed in 
comparable cells of a comparable non-human transgenic animal. 

[0193] In another embodiment, the present invention provides a non-human transgenic 
animal having a genome comprising a recombinantly inactivated nucleotide sequence 
encoding a MICAL polypeptide that has been recombinantly inactivated. The non-human 
transgenic animal has an altered phenotype that results from inactivation of the MICAL 
polypeptide. For example, the altered phenotype can be an altered axon guidance 
phenotype. 
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[0194] The non-human transgenic animal of this aspect of the invention, in certain 
aspects is heterozygous for the nucleotide sequence that has been inactivated. Alternatively, 
the non-human transgenic animal of this aspect of the invention, can be homozygous for the 
nucleotide sequence that has been recombinantly inactivated. 

[0195] As used herein, the term "transgenic," when used in reference to an animal or an 
organism, means that cells of the animal or organism have been genetically manipulated to 
contain an exogenous polynucleotide sequence that is stably maintained with the cells. The 
manipulation can be, for example, microinjection of a polynucleotide or infection with a 
recombinant virus containing the polynucleotide. Thus, the term "transgenic" is used herein 
to refer to animals (organisms) in which one or more cells receive a recombinant 
polynucleotide, which can be integrated into a chromosome in the cell, or can be maintained 
as an extrachromosomally replicating polynucleotide, such as might be engineered into a 
yeast artificial chromosome. The term "transgenic animal" also includes a "germ cell line" 
transgenic animal. A germ cell line transgenic animal is a transgenic animal in which the 
genetic information has been taken up and incorporated into a germ line cell, therefore 
conferring the ability to transfer the information to offspring. If such offspring in fact 
possess some or all of that information, the offspring also are considered to be transgenic 
animals. The invention further encompasses transgenic organisms. 

[0196] A transgenic organism can be any organism whose genome has been altered by in 
vitro manipulation of an early stage embryo or a fertilized egg, or by any transgenic 
technology to induce a specific gene knock-out. The term "gene knock-out" refers to the 
targeted disruption of a gene in a cell or in vivo that results in complete loss of function. 
Gene knock-outs is also referred to herein as inactivated genes, such as recombinantly 
inactivated genes. A target gene in a transgenic animal can be rendered nonfunctional by 
an insertion targeted to the gene to be rendered nonfunctional, for example, by homologous 
recombination, or by any other method for disrupting the function of a gene in a cell. 

[0197] The transgene to be used in the practice of the subject invention can be a DNA 
sequence comprising a modified MICAL or MICAL-Like polypeptide coding sequence. 
Preferably, the modified MICAL or MICAL-Like gene is one that is disrupted by 
homologous targeting in embryonic stem cells. For example, the entire MICAL gene can be 
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deleted (See Examples herein). Optionally, the disruption (or deletion) can be accompanied 
by insertion of or replacement with another polynucleotide, for example, a polynucleotide 
encoding a nonfunctional MICAL or MICAL-Like polypeptide. A "knock-out" phenotype 
also can be conferred by introducing or expressing an antisense MICAL or MICAL-Like 
polypeptide encoding polynucleotide in a cell in the organism, or by expressing an antibody 
or a dominant negative MICAL or MICAL-Like polypeptide in the cells. 

[0198] Various methods are known for producing a transgenic animal. In one method, 
an embryo at the pronuclear stage (a "one cell embryo") is harvested from a female and the 
transgene is microinjected into the embryo, in which case the transgene will be 
chromosomally integrated into the germ cells and somatic cells of the resulting mature 
animal. Li another method, embryonic stem cells are isolated and the transgene is 
incorporated into the stem cells by electroporation, plasmid transfection or microinjection; 
the stem cells are then reintroduced into the embryo, where they colonize and contribute to 
the germ line. Methods for microinjection of polynucleotides into mammalian species are 
described, for example, in U.S. Patent No. 4,873,191, which is incorporated herein by 
reference. In yet another method, embryonic cells are infected with a retrovirus containing 
the transgene, whereby the germ cells of the embryo have the transgene chromosomally 
integrated therein. 

[0199] When the animals to be made transgenic are avian, microinjection into the 
pronucleus of the fertilized egg is problematic because avian fertilized ova generally go 
through cell division for the first twenty hours in the oviduct and, therefore, the pronucleus 
is inaccessible. Thus, the retrovirus infection method is preferred for making transgenic 
avian species (see U.S. Patent No. 5,162,215, which is incorporated herein by reference). If 
microinjection is to be used with avian species, however, the embryo can be obtained from a 
sacrificed hen approximately 2.5 hours after the laying of the previous laid egg, the 
transgene is microinjected into the cytoplasm of the germinal disc and the embryo is 
cultured in a host shell until maturity (Love et al., Biotechnology 12, 1994). When the 
animals to be made transgenic are bovine or porcine, microinjection can be hampered by the 
opacity of the ova, thereby making the nuclei difficult to identify by traditional differential 
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interference-contrast microscopy. To overcome this problem, the ova first can be 
centrifuged to segregate the pronuclei for better visualization. 

[0200] Non-human transgenic animals of the invention can be an invertebrate or a 
vertebrate. For example, the transgenic organism can be Drosophila or a mammal such as a 
mouse or a rat. The transgene can be introduced into embryonal target cells at various 
developmental stages, and different methods are selected depending on the stage of 
development of the embryonal target cell. The zygote is the best target for microinjection. 
The use of zygotes as a target for gene transfer has a major advantage in that the injected 
DNA can incorporate into the host gene before the first cleavage (Brinster et al., Proc. Natl 
Acad. Set, USA 82:4438-4442, 1985). As a consequence, all cells of the transgenic 
non-human animal carry the incorporated transgene, thus contributing to efficient 
transmission of the transgene to offspring of the founder, since 50% of the germ cells will 
harbor the transgene. 

[0201] A transgenic animal can be produced by crossbreeding two chimeric animals, 
each of which includes exogenous genetic material within cells used in reproduction. 
Twenty-five percent of the resulting offspring will be transgenic animals that are 
homozygous for the exogenous genetic material, 50% of the resulting animals will be 
heterozygous, and the remaining 25% will lack the exogenous genetic material and have a 
wild type phenotype. 

[0202] In the microinjection method, the transgene is digested and purified free from any 
vector DNA, for example, by gel electrophoresis. The transgene can include an operatively 
associated promoter, which interacts with cellular proteins involved in transcription, and 
provides for constitutive expression, tissue specific expression, developmental stage specific 
expression, or the like. Such promoters include those from cytomegalovirus (CMV), 
Moloney leukemia virus (MLV), and herpes virus, as well as those from the genes encoding 
metallothionein, skeletal actin, Phosphenolpyruvate carboxylase (PEPCK), 
phosphoglycerate (PGK), dihydrofolate reductase (DHFR), and thymidine kinase (TK). 
Promoters from viral long terminal repeats (LTRs) such as Rous sarcoma virus LTR also 
can be employed. When the animals to be made transgenic are avian, preferred promoters 
include those for the chicken P-globin gene, chicken lysozyme gene, and avian leukosis 
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virus. Constructs useful in plasmid transfection of embryonic stem cells will employ 
additional regulatory elements, including, for example, enhancer elements to stimulate 
transcription, splice acceptors, termination and polyadenylation signals, ribosome binding 
sites to permit translation, and the like. 

[0203] In the retroviral infection method, the developing non-human embryo can be 
cultured in vitro to the blastocyst stage. During this time, the blastomeres can be targets for 
retroviral infection (Jaenich, Proc. Natl. Acad. Sci, USA 73:1260-1264, 1976). Efficient 
infection of the blastomeres is obtained by enzymatic treatment to remove the zona 
pellucida (Hogan et al., Manipulating the Mouse Embryo (Cold Spring Harbor Laboratory 
Press, 1986). The viral vector system used to introduce the transgene is typically a 
replication-defective retrovirus carrying the transgene (Jahner et al, Proc. Natl. Acad. Sci., 
USA 82:6927-6931, 1985; Van der Putten et al., Proc. Natl. Acad. Sci, USA 82:6148-6152, 
1985). Transfection is easily and efficiently obtained by culturing the blastomeres on a 
monolayer of virus producing cells (Van der Putten et al., supra, 1985; Stewart et al., EMBO 
J. 6:383-388, 1987). Alternatively, infection can be performed at a later stage. Virus or 
virus-producing cells can be injected into the blastocoele (Jahner et al, Nature 298:623-628, 
1982). Most of the founders will be mosaic for the transgene since incorporation occurs 
only in a subset of the cells which formed the transgenic nonhuman animal. Further, the 
founder can contain various retroviral insertions of the transgene at different positions in the 
genome, which generally will segregate in the offspring. In addition, it is also possible to 
introduce transgenes into the germ line, albeit with low efficiency, by intrauterine retroviral 
infection of the mid-gestation embryo (Jahner et al., supra, 1982). 

[0204] Embryonal stem cell (ES) also can be targeted for introduction of the transgene. 
ES cells are obtained from pre-implantation embryos cultured in vitro and rased with 
embryos (Evans et al. Nature 292:154-156, 1981; Bradley et al., Nature 309:255-258, 1984; 
Gossler et al, Proc. Natl. Acad. Sci., USA 83:9065-9069, 1986; Robertson et al., Nature 
322:445-448, 1986). Transgenes can be efficiently introduced into the ES cells by DNA 
transfection or by retrovirus mediated transduction. Such transformed ES cells can 
thereafter be combined with blastocysts from a nonhuman animal. The ES cells thereafter 
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colonize the embryo and contribute to the germ line of the resulting chimeric animal (see 
Jaenisch, Science 240:1468-1474, 1988). 

[0205] The present invention also provides an antibody or antigen binding fragment 
thereof that specifically bind a MICAL polypeptide or a MICAL-Like polypeptide, or a 
functional peptide portion thereof. Particularly useful antibodies of the invention include 
antibodies that specifically bind a plexin interacting region of a MICAL, thereby inhibiting 
binding of the MICAL to plexins. Such antibodies can be useful, for example, for inhibiting 
semaphorin-mediate axonal repulsion and thus stimulating, for example, regeneration of 
axon connections after spinal cord injury. In certain aspects, the present invention provides 
an antibody or antigen binding fragment that binds an N-terminal MICAL domain, a 
MICAL calponin homology domain, a MICAL LIM domain, a MICAL proline rich region, 
or a MICAL plexin-interacting region. 

[0206] A monoclonal antibody that binds specifically to a MICAL or MICAL-Like 
polypeptide can be used to treat a pathological condition involving, for example failure of 
axon regrowth. In a preferred embodiment, the MICAL or MICAL-Like polypeptide 
antibody is administered to a patient by intravenous, intramuscular subcutaneous injection, 
or direct injection to a site of spinal cord damage. A monoclonal antibody can be 
administered, for example, within a dose range between about 0.1 jag/kg to about 100 
mg/kg; more preferably between about 1 ]Ug/kg to 75 mg/kg; most preferably from about 10 
mg/kg to 50 mg/kg. The antibody can be administered, for example, by bolus injunction or 
by slow infusion. Slow infusion over a period of 30 minutes to 2 hours is preferred. The 
anti- MICAL or anti-MICAL-Like polypeptide antibody, can be formulated in a formulation 
suitable for administration to a patient. Such formulations are known in the art. 

[0207] The dosage regimen will be determined by the attending physician considering 
various factors which modify the action of the MICAL or MICAL-Like polypeptide protein 
or the plexin protein, for example, amount of tissue desired to be formed, the site of tissue 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue, 
the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage can vary with the type of matrix used in the reconstitution 
and the types of agent, such as anti-MICAL or anti-MICAL-Like polypeptide antibodies, to 
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be used in the composition. Generally, systemic or injectable administration, such as 
intravenous, intramuscular, subcutaneous injection, or injection to a site of damage of the 
nervous system is employed. Administration generally is initiated at a dose which is 
minimally effective, and the dose is increased over a preselected time course until a positive 
effect is observed. Subsequently, incremental increases in dosage are made limiting such 
incremental increases to such levels that produce a corresponding increase in effect, while 
taking into account any adverse affects that can appear. The addition of other agents that 
promote neuron process regrowth, can also affect the dosage. 

[0208] As used herein, the term "antibody" is used in its broadest sense to include 
polyclonal and monoclonal antibodies, as well as antigen binding fragments of such 
antibodies. An antibody useful in a method of the invention, or an antigen binding fragment 
thereof, is characterized, for example, by having specific binding activity for an epitope of a 
MICAL or MIC AL-Like polypeptide, or a Plexin. In addition, as discussed above, an 
antibody of the invention can be an antibody that specifically binds a peptide portion of a 
MICAL, a MICAL-Like polypeptide, or a plexin, particularly a plexin-interacting region of 
a MICAL or a MICAL-binding region of a plexin. 

[0209] The term "binds specifically" or "specific binding activity," when used in 
reference to an antibody means that an interaction of the antibody and a particular epitope 
has a dissociation constant of at least about 1 x 10" 6 , generally at least about 1 x 10" 7 , usually 
at least about 1 x 10" 8 , and particularly at least about 1 x 10~ 9 or 1 x 10" 10 or less. As such, 
Fab, F(ab') 2 , Fd and Fv fragments of an antibody that retain specific binding activity for an 
epitope of a MICAL or MICAL-Like polypeptide, are included within the definition of an 
antibody. 

[0210] The term "antibody" as used herein includes naturally occurring antibodies as 
well as non-naturally occurring antibodies, including, for example, single chain antibodies, 
chimeric, bifunctional, human, and humanized antibodies, intrabodies (i.e. intracellularly 
expressed antibodies, see e.g., Chen, S. Y., et al., Hum. Gene Then 5:595-601 (1994)), as 
well as antigen-binding fragments thereof. Such non-naturally occurring antibodies can be 
constructed using solid phase peptide synthesis, can be produced recombinantly or can be 
obtained, for example, by screening combinatorial libraries consisting of variable heavy 
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chains and variable light chains (see Huse et al., Science 246:1275-1281 (1989), which is 
incorporated herein by reference). These and other methods of making, for example, 
chimeric, humanized, CDR- grafted, single chain, and Afunctional antibodies are well 
known to those skilled in the art (Winter and Harris, Immunol Today 14:243-246, 1993; 
Ward et al., Nature 341:544-546, 1989; Harlow and Lane, Antibodies: A laboratory manual 
(Cold Spring Harbor Laboratory Press, 1988); Hilyard et al., Protein Engineering: A 
practical approach (IRL Press 1992); Borrabeck, Antibody Engineering, 2d ed. (Oxford 
University Press 1995); each of which is incorporated herein by reference). 

[0211] Antibodies that bind specifically with a MICAL or MICAL-Like polypeptide can 
be raised using the MICAL or MICAL-Like polypeptide, or a fragment thereof, as an 
immunogen. Where such a polypeptide or fragment thereof is non-immunogenic, it can be 
made immunogenic by coupling the hapten to a carrier molecule such as bovine serum 
albumin (BS A) or keyhole limpet hemocyanin (KLH), or by expressing the peptide portion 
as a fusion protein. Various other carrier molecules and methods for coupling a hapten to a 
carrier molecule are well known in the art (see, for example, by Harlow and Lane, supra, 
1988). 

[0212] If desired, a kit incorporating an antibody or other agent useful in a method of the 
invention can be prepared. Such a kit can contain, in addition to the agent, a pharmaceutical 
composition in which the agent can be reconstituted for administration to a subject. The kit 
also can contain, for example, reagents for detecting the antibody, or for detecting specific 
binding of the antibody to a MICAL or MICAL-Like polypeptide. Such detectable reagents 
useful for labeling or otherwise identifying the antibody are described herein and known in 
the art. 

[0213] Methods for raising polyclonal antibodies, for example, in a rabbit, goat, mouse 
or other mammal, are well known in the art (see, for example, Green et al., "Production of 
Polyclonal Antisera," in Immunochemical Protocols (Manson, ed., Humana Press 1992), 
pages 1-5; Coligan et al., "Production of Polyclonal Antisera in Rabbits, Rats, Mice and 
Hamsters," in Curr. Protocols Immunol (1992), section 2.4.1; each or which is incorporated 
herein by reference). In addition, monoclonal antibodies can be obtained using methods 
that are well known and routine in the art (Harlow and Lane, supra, 1988). For example, 
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spleen cells from a mouse immunized with a MICAL or MICAL-Like polypeptide, or an 
epitopic fragment thereof, can be fused to an appropriate myeloma cell line such as SP/02 
myeloma cells to produce hybridoma cells. Cloned hybridoma cell lines can be screened 
using labeled antigen to identify clones that secrete monoclonal antibodies having the 
appropriate specificity, and hybridomas expressing antibodies having a desirable specificity 
and affinity can be isolated and utilized as a continuous source of the antibodies. The 
antibodies can be further screened for the inability to bind specifically with the MICAL or 
MICAL-Like polypeptide. Such antibodies are useful, for example, for preparing 
standardized kits for clinical use. A recombinant phage that expresses, for example, a single 
chain anti- MICAL or MICAL-Like polypeptide antibody also provides an antibody that can 
used for preparing standardized kits. 

[0214] Methods of preparing monoclonal antibodies well known (see, for example, 
Kohler and Milstein, Nature 256:495, 1975, which is incorporated herein by reference; see, 
also, Coligan et al., supra, 1992, see sections 2.5.1-2.6.7; Harlow and Lane, supra, 1988). 
Briefly, monoclonal antibodies can be obtained by injecting mice with a composition 
comprising an antigen, verifying the presence of antibody production by removing a serum 
sample, removing the spleen to obtain B lymphocytes, fusing the B lymphocytes with 
myeloma cells to produce hybridomas, cloning the hybridomas, selecting positive clones 
that produce antibodies to the antigen, and isolating the antibodies from the hybridoma 
cultures. 

[0215] Monoclonal antibodies can be isolated and purified from hybridoma cultures by a 
variety of well established techniques, including, for example, affinity chromatography with 
Protein-A SEPHAROSE, size exclusion chromatography, and ion exchange 
chromatography (Coligan et al., supra, 1992, see sections 2.7.1-2.7.12 and sections 
2.9.1-2.9.3; see, also, Barnes et al., "Purification of Immunoglobulin G (IgG)," in Meth. 
:Molec. Biol 10:79-104 (Humana Press 1992), which is incorporated herein by reference). 
Methods of in vitro and in vivo multiplication of monoclonal antibodies is well known to 
those skilled in the art. Multiplication in vitro can be carried out in suitable culture media 
such as Dulbecco's Modified Eagle Medium or RPMI 1640 medium, optionally replenished 
by a mammalian serum such as fetal calf serum or trace elements and growth sustaining 



WO 03/066821 



PCT/US03/03551 



65 



supplements such as normal mouse peritoneal exudate cells, spleen cells, bone marrow 
macrophages. Production in vitro provides relatively pure antibody preparations and allows 
scale-up to yield large amounts of the desired antibodies. Large scale hybridoma cultivation 
can be carried out by homogenous suspension culture in an airlift reactor, in a continuous 
stirrer reactor, or in immobilized or entrapped cell culture. Multiplication in vivo can be 
carried out by injecting cell clones into mammals histocompatible with the parent cells, for 
example, syngeneic mice, to cause growth of antibody producing tumors. Optionally, the 
animals are primed with a hydrocarbon, especially oils such as pristane 
(tetramethylpentadecane) prior to injection. After one to three weeks, the desired 
monoclonal antibody is recovered from the body fluid of the animal. 

[0216] Therapeutic applications for antibodies disclosed herein are also part of the 
present invention. For example, antibodies of the present invention can also be derived 
from subhuman primate antibody. General techniques for raising therapeutically useful 
antibodies in baboons can be found, for example, in Goldenberg et al., International Patent 
Publication WO 91/1 1465 (1991); and Losman et al., Int. J. Cancer 46:3 10, 1990, each of 
which is incorporated herein by reference. Accordingly, the present invention provides 
antibodies conjugated to a therapeutic moiety. For example, in certain aspects the present 
invention provides an anti-MICAL antibody conjugated to a monooxygenase inhibitor, for 
example EGCG. 

[021 7] A therapeutically useful anti- MICAL or MICAL-Like polypeptide antibody also 
can be derived from a "humanized" monoclonal antibody. Humanized monoclonal 
antibodies are produced by transferring mouse complementarity determining regions from 
heavy and light variable chains of the mouse immunoglobulin into a human variable 
domain, and then substituting human residues in the framework regions of the murine 
counterparts. The use of antibody components derived from humanized monoclonal 
antibodies obviates potential problems associated with the immunogenicity of murine 
constant regions. General techniques for cloning murine immunoglobulin variable domains 
are known (see, for example, Orlandi et al., Proc. Natl. Acad. Set, USA 86:3833, 1989, 
which is hereby incorporated in its entirety by reference). Techniques for producing 
humanized monoclonal antibodies also are known (see, for example, Jones et al., Nature 
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321:522, 1986; Riechmann et al., Nature 332:323, 1988; Verhoeyen et al., Science 
239:1534, 1988; Carter et al., Proc. Natl Acad. ScL, USA 89:4285, 1992; Sandhu, Crit. Rev. 
Biotechnol 12:437, 1992; and Singer et al., J. Immunol 150:2844, 1993; each of which is 
incorporated herein by reference). 

[0218] Antibodies of the invention also can be derived from human antibody fragments 
isolated from a combinatorial immunoglobulin library (see, for example, Barbas et al., 
METHODS: A Companion to Methods in Immunology 2:1 19, 1991 ; Winter et al., Ann. Rev. 
Immunol. 12:433, 1994; each of which is incorporated herein by reference). Cloning and 
expression vectors that are useful for producing a human immunoglobulin phage library can 
be obtained, for example, from STRATAGENE Cloning Systems (La Jolla, CA). 

[0219] An antibody of the invention also can be derived from a human monoclonal 
antibody. Such antibodies are obtained from transgenic mice that have been "engineered" 
to produce specific human antibodies in response to antigenic challenge. In this technique, 
elements of the human heavy and light chain loci are introduced into strains of mice derived 
from embryonic stem cell lines that contain targeted disruptions of the endogenous heavy 
and light chain loci. The transgenic mice can synthesize human antibodies specific for 
human antigens, and the mice can be used to produce human antibody-secreting 
hybridomas. Methods for obtaining human antibodies from transgenic mice are described, 
for example, by Green et al., Nature Genet. 7:13, 1994; Lonberg et al., Nature 368:856, 
1994; and Taylor et al., Int. Immunol. 6:579, 1994; each of which is incorporated herein by 
reference. 

[0220] Antibody fragments of the present invention can be prepared by proteolytic 
hydrolysis of the antibody or by expression in E. coli of DNA encoding the fragment. 
Antibody fragments can be obtained by pepsin or papain digestion of whole antibodies by 
conventional methods. For example, antibody fragments can be produced by enzymatic 
cleavage of antibodies with pepsin to provide a 5S fragment denoted F(ab f )2. This fragment 

can be further cleaved using a thiol reducing agent, and optionally a blocking group for the 
sulfhydryl groups resulting from cleavage of disulfide linkages, to produce 3.5S Fab' 
monovalent fragments. Alternatively, an enzymatic cleavage using pepsin produces two 
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monovalent Fab 1 fragments and an Fc fragment directly (see, for example, Goldenberg, U.S. 
Patent No. 4,036,945 and U.S. Patent No. 4,331,647, each of which is incorporated by 
reference, and references contained therein; Nisonhoff et al., Arch Biochem. Biophys. 
89:230. 1960; Porter, Biochem. J. 73:119, 1959; Edelman et al., Meth. EnzymoL, 1:422 
(Academic Press 1967), each of which is incorporated herein by reference; see, also, 
Coligan et al., supra, 1992, see sections 2.8.1-2.8.10 and 2.10.1-2.10.4). 

[0221] Other methods of cleaving antibodies, such as separation of heavy chains to form 
monovalent light/heavy chain fragments, further cleavage of fragments, or other enzymatic, 
chemical, or genetic techniques can also be used, provided the fragments specifically bind 
to the antigen that is recognized by the intact antibody. For example, Fv fragments 
comprise an association of V H and V L chains. This association can be noncovalent (Inbar 
et al., Proc. Natl Acad. Set, USA 69:2659, 1972). Alternatively, the variable chains can be 
linked by an intermolecular disulfide bond or cross-linked by chemicals such as 
glutaraldehyde ( Sandhu, supra, 1992). Preferably, the Fv fragments comprise V H and V L 
chains connected by a peptide linker. These single-chain antigen binding proteins (sFv) are 
prepared by constructing a structural gene comprising DNA sequences encoding the V H and 
V L domains connected by an oligonucleotide. The structural gene is inserted into an 
expression vector, which is subsequently introduced into a host cell such as E. coli. The 
recombinant host cells synthesize a single polypeptide chain with a linker peptide bridging 
the two V domains. Methods for producing sFvs are described, for example, by Whitlow et 
al., Methods: A Companion to Methods in Enzymology 2:97, 1991; Bird et al., Science 
242:423-426, 1988; Ladner et al., U.S. patent No. 4,946,778; Pack et al., Bio/Technology 
1 1:1271-1277, 1993; each of which is incorporated herein by reference; see, also Sandhu, 
supra, 1992. 

[0222] Another form of an antibody fragment is a peptide coding for a single 
complementarity-determining region (CDR). CDR peptides ("minimal recognition units") 
can be obtained by constructing genes encoding the CDR of an antibody of interest. Such 
genes are prepared, for example, by using the polymerase chain reaction to synthesize the 
variable region from RNA of antibody-producing cells (see, for example, Larrick et al., 
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Methods: A Companion to Methods in Enzymology 2:106, 1991, which is incorporated 
herein in its entirety by reference). 

[0223] An intrabody comprises at least a portion of an antibody that is capable of 
immunospecifically binding an antigen and preferably does not contain sequences coding 
for its secretion. Such antibodies will bind antigen intracellularly. In one embodiment, the 
intrabody comprises a single-chain Fv ("sFv"). sFvs are antibody fragments comprising the 
VH and VL domains of antibody, wherein these domains are present in a single polypeptide 
chain. Generally, the sFv polypeptide further comprises a polypeptide linker between the 
VH and VL domains which enables the sFv to form the desired structure for antigen 
binding. For a review of sFvs see Pluckthun in The Pharmacology of Monoclonal 
Antibodies, vol. 1 13, Rosenburg and Moore eds. Springer-Verlag, New York, pp. 269-315 
(1994). In a further embodiment, the intrabody preferably does not encode an operable 
secretory sequence and thus remains within the cell (see generally Marasco, WA, 1998, 
"Intrabodies: Basic Research and Clinical Gene Therapy Applications" SpringeriNew 
York). Generation of intrabodies is well-known to the skilled artisan and is described, for 
example, in U.S. Patent Nos. 6,004,940; 6,072,036; 5,965,371, which are incorporated by 
reference in their entireties herein. Further, the construction of intrabodies is discussed in 
Ohage and Steipe, 1999, J, Mol Biol 291:1119-1128; Ohage et al., 1999, J. Mol Biol 
291 : 1 129-1 134; and Wirtz and Steipe, 1999, Protein Science 8:2245-2250, which references 
are incorporated herein by reference in their entireties. 

[0224] In another embodiment, the present invention provides a double-stranded RNA 
molecule that includes a first RNA strand that specifically hybridizes to an mRNA encoding 
a MICAL polypeptide. The double-stranded RNA molecule also includes a second RNA 
strand that is the reverse complement of the first strand. The double-stranded molecule is 
at least 15 base pairs in length. Double stranded RNA is involved in RNA interference, the 
process by which double-stranded RNA induces the silencing of homologous endogenous 
genes. (Hammond, S. M., et al., Nat Rev. Genet. 2(2): 110-9 (2001)). Accordingly, double 
stranded RNA of this embodiment inhibit expression of a MICAL polypeptide, thereby 
promoting axonal growth and target formation. 
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[0225] In another embodiment the present invention provides a method for identifying 
an agent that affects an activity of a MICAL or MICAL-Like protein. As such, the present 
invention provides screening methods for agents that affect MICAL protein or MICAL-Like 
protein activity. The method typically includes contacting a MICAL polypeptide, or a 
functional portion thereof or a MICAL-Like polypeptide, or a functional portion thereof, or 
a cell expressing at least one of these polypeptides, with a candidate agent, and determining 
whether the agent affects an activity of the polypeptide. The activity of the MICAL protein, 
can be any of the activities identified herein for a MICAL polypeptide. 

[0226] For example, a method according to this embodiment can identify an agent that 
affects MICAL monooxygenase activity or plexin interacting activity. Methods for 
identifying monooxygenase activity and plexin interacting activity are known in the art. 
Examples of these methods are provided herein. For example, an immunoprecipitation 
experiment can be performed in the presence of plexA and a MICAL polypeptide, wherein 
the plexA and/or MICAL polypeptide are contacted with an on-test agent. It can then be 
determined whether the agent affected binding of PlexA and the MICAL polypeptide. 
Agents that affect binding are candidate agents for treatment of disorders such as spinal 
cord injury, since they are expected to inhibit semaphorin-mediated axonal repulsion. 

[0227] In one aspect of this embodiment, the method can identify an agent that affects a 
semaphorin-mediated process. That is, the activity of a MICAL protein can be 
participation, typically regulation, of a semaphorin mediated process. For example, a 
semaphorin-mediated process can be semaphorin-mediated axonal repulsion. In particular 
embodiments, the semaphorin-mediated process is mediated by semaphorins 1 A and 3 A, 
4A, or a class 7 semaphorin. 

[0228] In aspects of this embodiment that include a cell, the cell can be virtually any 
cell. Recombinant cells can be produced using standard techniques as disclosed herein, that 
express a MICAL polypeptide using polynucleotides and vectors of the present invention. 
In embodiments where the method identifies an agent that affects a MICAL activity and/or 
a semaphorin-mediated process, the cell, for example, can be a cell of a type that is known 
to include a semaphorin-mediated process, as discussed hereinbelow, such as a cell of the 
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immune system, for example a B cell or a T cell, a cell of neuronal origin, a cell with a 
transformed phenotype, a cardiac cell, or a neural crest precursor cell of a cardiac cell. 

[0229] In another aspect of this embodiment, the present invention provides a method for 
identifying an agent that affects axonal guidance regulatory activity, monooxygenase 
activity, actin binding activity and/or plexin-interacting activity. The method typically 
includes contacting an isolated polypeptide of the present invention that has axonal 
guidance regulatory activity, monooxygenase activity, actin binding activity, and/or plexin- 
interacting activity, or a cell expressing the polypeptide, with a candidate agent. Next, 
axonal guidance regulatory activity, monooxygenase activity, actin binding activity, and/or 
plexin-interacting activity is compared in the presence versus absence of the agent. A 
difference in activity is indicative of an agent that affects axonal guidance regulatory 
activity, monooxygenase activity, and/or plexin-interacting activity. In this aspect of the 
invention, the cell is typically a cell of neuronal original, such as a neuron that 
recombinantly expresses the MICAL. 

[0230] Methods for determining axonal guidance regulatory activity (i.e. axon guidance 
regulatory activity assays), for example semaphorin axonal repulsion activity, are known in 
the art. Some of these methods are disclosed herein. For example, an in vitro method such 
as a rat DRG growth cone repulsion assay using Sema 3A-secreting 293 cells, as disclosed 
above, can be employed (Figure 4A; Messersmith et al., 1995). It will be recognized 
however, that virtually any assay of semaphorin mediated axon repulsion can be employed 
in the methods of the present invention for identifying an agent that affects axonal guidance 
regulatory activity. Methods for assessing plexin-interacting activity and monooxygenase 
activity are provided herein. 

[0231] In another embodiment, the present invention provides a method for screening for 
an agent that modulates an activity of a MICAL or MICAL-Like polypeptide. The method 
includes contacting a cell that recombinantly expresses a MICAL or MICAL-Like 
polypeptide with a candidate agent. Then a phenotypic or physiological trait of the cell is 
compared in the presence or absence of the candidate agent. A difference in the phenotypic 
or physiological trait indicates that the agent modulates the activity of the MICAL 
polypeptide. The phenotypic or physiological trait can involve dynamics of the 
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cytoskeleton, or can be axon guidance, cell proliferation or invasiveness, or an immune 
response. 

[0232] In another embodiment, the present invention provides a method for screening for 
an agent that modulates the expression of a MIC AL or MICAL-Like polypeptide. The 
method includes contacting a cell with a candidate agent. Then comparing the expression of 
the MICAL or MICAL-Like polypeptide, for example in the presence or absence of the 
candidate agent. A difference in the expression indicates that the agent modulates the 
expression of the MICAL polypeptide. In one aspect, the level of mRNA encoding 
MICAL is compared. In another aspect, the level of the MICAL polypeptide is compared. 

[0233] Specificity can be further analyzed, for example, by assaying for an activity or 
phenotype in cells that recombinantly overexpress MICAL and comparable cells that do not 
recombinantly overexpress MICAL, or cells in which MICAL has been knocked out or 
reduced versus comparable cells in which MICAL is expressed at normal levels. 

[0234] As illustrated in the Examples herein, MICALs are susceptible to small molecule 
inhibitors that affect their ability to oxidize their substrate. For example, gallic acid 
derivatives, including the green tea component (-)-epigallocatechin gallate (EGCG), appear 
to be potent and selective inhibitors of MICALs. It will be recognized that based on the 
identification of these small molecule inhibitors of semaphorin-mediated axonal repulsion, 
it will be recognized that other small molecule inhibitors can be identified. 

[0235] All available evidence points to the plexin cytoplasmic domain as an essential 
signal transducing domain for signaling class 3 semaphorin repulsion (Cheng et al., 2001; 
Takahashi and Strittmatter, 2001). Sema3A appears to utilize neuropilin-1 in combination 
with A class plexins to signal repulsive axon guidance. As illustrated in the Examples 
herein, agents that affect MICAL axonal guidance regulatory activity can be identified in 
vitro. For example, a rat DRG growth cone repulsion assay using Sema 3 A-secreting 293 
cells, as disclosed above, can be employed (Figure 4A; Messersmith et al., 1995). NGF- 
dependent DRG axons exhibit little to no outgrowth towards Sema3 A-secreting 293 cell 
aggregates (Figure 4C). However, this repulsion can be neutralized in a specific and dose- 
dependent manner by inclusion of an inhibitor such as EGCG or EC, in the growth media 
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(Figure 4C). As illustrated in the Example, like EGCG, EC is capable of completely 
neutralizing Sema-3 A-dependent repulsion in a dose-dependent manner, but a much higher 
EC concentration is required (Figures 4C). 

[0236] As will be understood, typically a method according to this embodiment of the 
invention includes a control sample wherein the polypeptide or cell is not contacted with the 
agent. However, known control values or qualitative results can also be used, such that a 
control sample does not need to be included each time the method is performed. For 
example, a visual microscopic comparison of axonal outgrowth can be performed using 
methods of the present invention that utilize the DRG rat growth cone repulsion assay. In 
addition, results of the rat DRG growth cone repulsion assay can be quantitated as 
illustrated in Fig. 4C. For example, results can be scored as the ratio of the axon lengths on 
the proximal and distal sides of the explant (P/D ratio), and on Sema3A-mediated growth 
cone collapse indicated as % collapsed growth cones. Therefore, known values for controls 
can be calculated and used to establish a baseline using known statistical methods, above 
which significant inhibition of the repulsion is established, thereby identifying an agent that 
affects axon guidance regulatory activity, most particular semaphorin-mediated axon 
repulsion. 

[0237] The agent can affect axonal guidance regulatory activity, mono oxygenase 
activity, and/or plexin interacting activity by enhancing or inhibiting this activity. The 
agent can be a small molecule, such as an antioxidant flavonoid, an antisense 
polynucleotide, a MICAL-like polypeptide or fragment thereof, a mutant MICAL 
polypeptide, a mutant MICAL-Like polypeptide, an anti-MICAL antibody, a double 
stranded RNA, or a peptidomimetic. 

[0238] As indicated herein, anti-oxidant flavonoids such as ECGC and EC are inhibitors 
of semaphorin-mediated axonal repulsion. Therefore, particularly important candidate 
agents include those with anti-oxidant activity, especially anti-oxidant flavonoids or other 
green tea polyphenols. A variety of antioxidant flavonoids have been identified and can be 
analyzed to determine whether they affect semaphorin-mediated axonal repulsion. For 
example, the antioxidant flavonoid can be a gallic acid derivative or another flavoprotein 
monooxygenase inhibitor. Examples of gallic acid derivatives that can be analyzed using 
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methods of the present invention include, but are not limited to, (-)-gallocatechin-3-0- 
gallate (GCG), (->epicatechin-3-<9-gallate (ECG), (-)-epigallocatechin (EGC), 
(+)-gallocatechin (GC), theasinensin A, 3"-0-methyl-EGCG, 3"-Omethyl-ECG, 3"-<9- 
methyl-GCG, (-)-epigallocatechin (EGC), (l)-gallocatechin (GC), gallic acid, catechin, 
n-octyl gallate, and w-cetyl gallate. 

[0239] Additionally, it will be recognized that new antioxidant flavonoids, for example 
new gallic acid derivatives can be synthesized and tested using the methods of this aspect of 
the present invention for the ability to affect axonal guidance regulatory activity. 
Furthermore, flavoprotein monooxygenase inhibitors, which include but are not limited to 
certain anti-oxidant flavonoids can be analyzed for an affect on axonal guidance regulatory 
activity using methods of this embodiment of the present invention. 

[0240] An agent tested by the screening embodiments of the present invention can 
include a known oxidase inhibitor. For example, an oxidase inhibitor outlined in Cross, 
Free Radical Biology and Med., 8:71-93 (1990) including, for example, DPI, ibuprofen, and 
aspirin. Furthermore, the screening assay can be used to test other agents including 
inhibitors of similar flavoprotein monooxygenase family members (See e.g., Cross (supra.); 
and Arnould and Camadro, Proc. Nat. Acad. Sci. 9 95:10553-10558 (1998)). 

[0241] In certain embodiments, a screening method of the invention can be performed, 
for example, by contacting under suitable conditions a MICAL, or a functional peptide 
portion thereof, a plexin such as plexin A, and an agent to be tested. The MICAL, the 
plexin and the agent can be contacted in any order as desired. As such, the screening 
method can be used to identify agents that can competitively or non-competitively inhibit 
MICAL binding to plexin, agents that can mediate or enhance MICAL binding to the 
plexin, agents that can induce dissociation of specifically bound MICAL from the plexin, 
and agents that otherwise affect the ability of MICAL to regulate axon guidance, such 
agents having agonist or antagonist activity. Appropriate control reactions are performed to 
confirm that the action of the agent is specific with respect to the MICAL. 

[0242] Suitable conditions for performing a screening method of the invention can be 
any conditions that allow MICAL to specifically interact with plexin, that provide a 
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semaphorin-plexin repulsive axon guidance activity, and/or that support MICAL 
monooxygenase activity, including methods as disclosed herein or otherwise known in the 
art. Thus, suitable conditions for performing the screening assay can be, for example, 
in vitro conditions using a substantially purified MICAL and/or plexin; cell culture 
conditions, utilizing a cell that normally expresses a semaphorin-plexin A mediated 
repulsive axon activity and a MICAL polypeptide, for example, a neuron, or a cell that has 
been genetically modified to express a semaphorin-plexin A mediated repulsive axon 
activity including expression of a MICAL polypeptide; or in situ conditions as occur in an 
organism. 

[0243] A screening method of the invention also can be performed using the methods of 
molecular modeling. The utilization of a molecular modeling method provides a 
convenient, cost effective means to identify those agents, among a large population such as 
a combinatorial library of potential agents, that are most likely to interact specifically with 
MICAL or a plexin, thereby reducing the number of potential agents that need to be 
screened using a biological assay. Upon identifying agents that interact specifically with a 
MICAL or a plexin, for example Plexin A, using a molecular modeling method, the selected 
agents can be examined for the ability to modulate an effect of a MICAL on a cell, such as 
regulation of axon guidance, using the methods disclosed herein. 

[0244] The ability of a test agent to modulate an effect of MICAL can be detected using 
methods as disclosed herein or otherwise known in the art. The term "test agent" or "test 
molecule" is used broadly herein to mean any agent that is being examined for agonist or 
antagonist activity in a method of the invention. Although the method generally is used as a 
screening assay to identify previously unknown molecules that can act as agonist or 
antagonist agents as described herein, the methods also can be used to confirm that a agent 
known to have a particular activity in fact has the activity, for example, in standardizing the 
activity of the agent. 

[0245] Further assays for testing the specificity of a candidate agent for MICAL, for 
example, test for lack of inhibition of steroid, 5 alpha-reductase, NADPH-cytochrome P450 
reductase, telomerase, MMP-2, MMP-9, and phenol sulfotransferase. These enzymes are 
known to be inhibited by EGCG in addition to its inhibition of flavoprotein 



WO 03/066821 



PCT/US03/03551 



75 

monooxygenases. Therefore, by using these assays, agents can be identified that are more 
specific inhibitors of MICALs than EGCG. 

[0246] A screening method of the invention provides the advantage that it can be 
adapted to high throughput analysis and, therefore, can be used to screen combinatorial 
libraries of test agents in order to identify those agents that can modulate an effect of 
MICAL on a cell, including those agents that can alter a specific interaction of MICAL and 
a plexin, and those that otherwise modulate MICAL axon guidance regulatory activity. 
Methods for preparing a combinatorial library of molecules that can be tested for a desired 
activity are well known in the art and include, for example, methods of making a phage display 
library of peptides, which can be constrained peptides (see, for example, U.S. Patent 
No. 5,622,699; U.S. Patent No. 5,206,347; Scott and Smith, Science 249:386-390, 1992; 
Markland et al., Gene 109:13-19, 1991; each of which is incorporated herein by reference); a 
peptide library (U.S. Patent No. 5,264,563, which is incorporated herein by reference); a 
peptidomimetic library (Blondelle et al, Trends Anal. Chem. 14:83-92, 1995; a nucleic acid 
library (O'Connell et al., supra, 1996; Tuerk and Gold, supra, 1990; Gold et al., supra, 
1995; each of which is incorporated herein by reference); an oligosaccharide library (York 
et al., Carb. Res., 285:99-128, 1996; Liang et al., Science, 274:1520-1522, 1996; Ding et al., 
Adv. Expt. Med. Biol, 376:261-269, 1995; each of which is incorporated herein by 
reference); a lipoprotein library (de Kruif et al., FEBS Lett, 399:232-236, 1996, which is 
incorporated herein by reference); a glycoprotein or glycolipid library (Karaoglu et al., 
J. Cell Biol, 130:567-577, 1995, which is incorporated herein by reference); or a chemical 
library containing, for example, drugs or other pharmaceutical agents (Gordon et al., 
J. Med. Chem., 37:1385-1401, 1994; Ecker and Crooke, Bio/Technology, 13:351-360, 1995; 
each of which is incorporated herein by reference). Polynucleotides can be particularly 
useful as agents that can modulate a specific interaction of MICAL and a plexin because 
nucleic acid molecules having binding specificity for cellular targets, including cellular 
polypeptides, exist naturally, and because synthetic molecules having such specificity can 
be readily prepared and identified (see, for example, U.S. Patent No. 5,750,342, which is 
incorporated herein by reference). 
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[0247] In view of the present disclosure, it will be recognized that various animal model 
systems can be used as research tools to identify agents useful for practicing a method of the 
invention. For example, as discussed above, transgenic flies, mice or other experimental 
animals can be prepared and the transgenic non-human organism can be examined directly 
to determine the effect produced by expressing various levels of a particular agent in the 
organism. 

[0248] As disclosed herein, MICALs can exert their activity, at least in part, through a 
Semaphorin-Plexin mediated pathway which can be associated with various pathological 
conditions. As such, the present invention provides new targets for the treatment of various 
conditions, especially regeneration of damaged neurological tissue, such as damage 
resulting from spinal cord injury, abnormal immune cell function, and abnormal cell 
motility such as cancer cell motility. Accordingly, the present invention provides methods 
for ameliorating the severity of a pathological condition in a subject, wherein the pathologic 
condition is characterized, for example, by an inability of neurons to regenerate proper 
axonal connections. 

[0249] In another embodiment, the present invention provides a method for affecting 
axonal guidance regulatory activity. The method includes contacting a cell, typically a 
neuron, that expresses a polypeptide of the invention such as a MICAL polypeptide, with an 
agent that affects axonal guidance regulatory activity or monooxygenase activity. 
Typically, the method is performed in vivo and includes inhibiting axonal guidance 
regulatory activity by contacting the cell with an antioxidant. The cell can be contacted 
chronically with the antioxidant. The axonal guidance activity is typically semaphorin- 
mediated axonal repulsion. As such, in another embodiment, the present invention provides 
a method for affecting a semaphorin-mediated process by contacting a cell that expresses a 
MICAL polypeptide of the invention with an effective amount of an agent that affects 
axonal guidance regulatory activity. The agent can be, for example, a small molecule, a 
polypeptide or fragment thereof, a peptidomimetic, or an antisense polynucleotide, as 
discussed herein. 



[0250] Not to be limited by theory, as disclosed herein MICALs regulate semaphorin- 
mediated axonal repulsion through a mechanism that requires their monooxygenase activity. 
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Therefore, as illustrated in the Examples section, antioxidants such as flavonoids inhibit 
semaphorin-mediated axonal repulsion, apparently by overcoming the effects of MICAL 
monooxygenase activity. 

[0251] As mentioned above, following spinal cord injury in humans, axons fail to 
reestablish their connections, which results in paralysis and loss of sensation of the affected 
area. During development, inhibition of axon growth plays a role in forming the nervous 
system. Axons are guided to their targets by molecules that attract them as well as by those 
that inhibit (i.e., repel) them. The inhibition of semaphorin-mediated axonal repulsion can 
allow axons to reach new targets. Therefore, methods for treating spinal cord injuries can 
focus on inhibiting axonal repulsion (See e.g., Schwab, M. E., Science, 295:1029 (2002); 
andFournier, A. E., and Strittmatter, S. M., Current Opinion inNeurobioL, 11:89 (2001)). 
Accordingly, methods of this embodiment of the invention are useful for example, for 
inducing regeneration of axons after spinal cord injury. 

[0252] In another embodiment, the present invention provides a method for treating a 
neurological condition in a subject, that includes contacting in the subject, a cell of the 
central nervous system and/or peripheral nervous system having a disrupted axonal 
connection, or a cell that affects axonal growth of the central nervous system and/or 
peripheral nervous system cell, or surrounding tissue, with an amount of an agent that 
modulates the activity or expression of a MICAL polypeptide, the amount being effect to 
modulate axonal guidance regulatory activity, axon out-growth, monooxygenase activity, or 
plexin-interacting activity. The subject in certain aspects is a human patient in need of the 
treatment. The neurological condition is any neurological condition in which a treatment 
strategy includes promoting axon growth. For example, neurological conditions treatable 
using methods of the present invention include a spinal cord injury, traumatic brain injury, 
neuropathic pain, Parkinson's Disease, Amyotrophic Lateral Sclerosis (ALS), ischemic 
injury, Alzheimer's Disease, Multiple Sclerosis, Huntington' s chorea, multiple system 
atrophy, progressive supranuclear palsy, traumatic brain injury, neuropathic pain, ischemic 
injury, a neuropathy resulting from a stroke, a peripheral neuropathy resulting from 
chemotherapy, or a peripheral neuropathy resulting from diabetes. The neurodegenerative 
disease may be associated with a bacterial, viral or other infection, such as damage caused 
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by HIV or herpes viral infections, encephalitis, and Creutzfeldt- Jacob disease and kuru or 
may be due to the effects of a drug or toxin. Surrounding tissue is any substrate through 
which an axon need to re-grow or re-establish its connections. 

[0253] In certain aspects of this embodiment of the invention, the agent contacts a site in 
need of axonal growth or regrowth chronically. For example, the agent is applied for a 
length of time sufficient to promote neurorestoration. In general, the length of time of 
chronic application of an agent for the present invention is longer than the time period 
required to protect injured neurons (i.e. neuroprotection) from the harmful cascade of 
secondary events that follow injury, for example detrimental inflammatory responses and 
death of neurons and glia (i.e. secondary death due to reactive oxidative species (lipid 
peroxidation)). The length of time for chronic agent contact in the present invention is the 
time necessary to continue to stimulate axonal growth, to guide axons to their targets, and/or 
to establish new functional synapses. In certain aspects of the present invention, an agent 
can be applied initially to save neurons (neuroprotection), and continued over longer periods 
to promote neurorestoration. Therefore, the present invention in certain aspects, couples 
neuroprotection and neurorestoration through delivery of agents a short time after 
neurological damage, followed by long-term administration of the agent. 

[0254] In one aspect of the present invention, the agent is applied for at least 1, 2, 7, or 
14 days, or 1, 2, 3, 4, 5, 6, 12, 24, 36, 48, 60 months, either continually or repeatedly, for 
example by use of gene therapy, after identification or suspicion of a neurological condition. 
Other embodiments of the invention that include beneficial aspects directed at chronic 
application of an agent include for example, a method for affecting axon growth, a method 
for affecting a plexin mediated process, a method for treating a neurological disorder 
involving failure of axon regrowth, and a method for inducing regrowth and preventing 
inhibition of an injured process of a neuron. 

[0255] As indicated herein, monooxygenase inhibitors and anti-oxidant flavonoids, 
including those that are monooxygenase inhibitors such as ECGC and EC, are inhibitors of 
semaphorin-mediated axonal repulsion. Accordingly, monooxygenase inhibitors and anti- 
oxidant flavonoids can be used as the agent in methods of various embodiments of the 
present invention, to inhibit axonal guidance regulatory activity. These embodiments of the 
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invention include methods for treating a neurological condition, methods for affecting 
axonal guidance regulatory activity, methods for affecting axon growth, methods for 
affecting a plexin-mediated process, methods for treating a neurological disorder involving 
a failure of axon regrowth, and methods for inducing regrowth of an injured process of a 
neuron. 

[0256] As mentioned above, a variety of monooxygenase inhibitors and anti-oxidant 
flavonoids have been identified and can be used to inhibit axonal guidance regulatory 
activity, such as semaphorin-mediated axonal repulsion. The anti-oxidant flavonoid can be 
a gallic acid derivative such as ECGC or EC, another flavoprotein monooxygenase 
inhibitor, another green tea polyphenol. Examples of gallic acid derivatives that can be 
used to affect axonal guidance regulatory activity in embodiments of the present invention 
include, but are not limited to, ECGC, EC, (-)-gallocatechin-3-(3-gallate (GCG), (-)- 
epicatechin-3-O-gallate (ECG), (-)-epigallocatechin (EGC), (+)-gallocatechin (GC), 
theasinensin A, 3"-<9-methyl-EGCG, 3"-<9-methyl-ECG, 3"-<9-methyl-GCG, (-)- 
epigallocatechin (EGC), (l)-gallocatechin (GC), gallic acid, catechin, rc-octyl gallate, and n- 
cetyl gallate. 

[0257] An agent used in the methods of the present invention for affecting axonal 
guidance regulatory activity or monooxygenase activity, in certain aspects, is a known 
oxidase inhibitor. For example, an oxidase inhibitor reported in Cross, Free Radical 
Biology and Med., 8:71-93 (1990) including DPI, ibuprofen, and/or aspirin. Furthermore, 
the screening assay can be used to test other agents including inhibitors of similar 
flavoprotein monooxygenase family members (See e.g., Cross (supra.); and Arnould and 
Camadro, Proc. Nat. Acad. Set, 95:10553-10558 (1998)). 

[0258] Various embodiments of the present invention can further include contacting the 
cell with an agent that modulates MICAL activity and affects axon regeneration. The 
embodiments include methods for treating a neurological condition, methods for treating a 
neurological disorder involving a failure of axon regrowth, and methods for inducing 
regrowth of an injured process of a neuron, as described herein. In certain aspects, the agent 
that affects axon regeneration promotes axon regeneration. Accordingly, the agent can be a 
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neurotrophic factor, a mechanical bridge, and/or a stem cell (see e.g., Blesch A., et al., 
Brain Res. Bulletin, 57:83, 2002; and Cao, Q., et al., J. NeuroscL Res., 68: 501 (2002)). 

[0259] Mechanical bridges include, for example, genetically engineered cells, stem cells, 
fetal tissue, Schwann cells, olfactory ensheathing glia, as discussed below. Neurotrophins 
are molecules with closely related structures that are known to support the survival of 
different classes of embryonic neurons. Virtually any neurotrophin can be used in methods 
of the present invention, including, for example, nerve growth factor (NGF), brain-derived 
neurotrophic factor (BDNF), neurotrophin 3 (NT 3), neutortrophin-4/5 (NT-4/5), glia cell 
line-derived neurotrophic factor (GDNF), and leukemia inhibitory factor (LIF). 

[0260] Neurotrophins can be delivered to a site of severed axons, for example, by use of 
gene therapy. For example, viral vectors, including retroviral vectors, as described herein, 
capable of infecting neurons or glia can provide a localized source of trophic factors, such 
as neurotrophins to stimulate axonal outgrowth. Genetically modified cells grafted to a 
lesion site in the spinal cord can provide not only augmented amounts of trophic molecules 
at the injury site, but also a potential axonal growth substrate. Thus, genetically modified 
cells can provide mechanical bridges for growing axons to potentially connect injured spinal 
cord regions(Blesch et al., 2002). Furthermore, genetically modified cells can be used for 
long-term delivery of a neurotrophin, in a regulated manner. Furthermore, gene therapy can 
be used to deliver MICAL inhibitors, such as mutant MICALs or MICAL-Like proteins, 
chronically to a site of spinal cord injury. 

[0261] Cell types used for grafting in gene therapy, can include for example fibroblasts, 
Schwann cells, and neural stem cells (see e.g., Brecknell J. E., et al., Neuroscience, 
74(3):775-84 (1996)). Furthermore, autologous cells can be used to avoid immune 
responses and graft rejection. Gene therapy for use in the present invention can include in 
vivo or ex vivo gene therapy. 

[0262] Methods of the present invention that include providing a stem cell as well as an 
anti-oxidant typically involve providing both the stem cell and the anti-oxidant to a site of 
neuronal damage, such as a site of spinal cord injury. For example, the stem cell can be 
grafted to a site of spinal cord injury. 
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[0263] A variety of stem cells can be used with the methods of the present invention. 
The stem cells are typically neural stems cells (i.e., stem cells that give rise to cells of 
neuronal lineage). The stem cell can be an embryonic stem cell or an adult stem cell and 
can be isolated, for example from blood or bone marrow (See e.g., Kabos, P., et al., Exp 
Neurol, 178(2):288-93 (2002)). Neural stem cells typically express the marker, nestin. 
The stem cells are capable of giving rise to neurons and glial cells. 

[0264] Neural stem cells could be induced towards neuronal phenotypes to allow the 
replacement of spinal neurons lost after injury, toward astrocytes to restore the non-neuronal 
milieu of the pre-injured spinal cord, or towards oligodendroglia to allow remyelination. 
Either neuronal or glial lineages could be useful for reconstituting a permissive substrate for 
regenerating axons to extend over sites of injury (Blesch et al., 2002). The cells can be 
modified to produce a recombinant neurotrophic factor. 

[0265] Other mechanical bridges of the invention in certain aspects, include an axon 
outgrowth promoting molecule such as netrin, laminin, collagen, or artificial polymer-based 
substrates. In another embodiment, the present invention provides a method for affecting 
axon growth, that includes contacting a neuronal lineage cell with an agent that inhibits 
axonal guidance regulatory activity of a polypeptide as set forth in SEQ ID NO:2 (human 
MICAL-1), SEQ ED NO:4 (human MICAL-2), SEQ ID NO:6 (human MICAL-3), or SEQ 
ID NO: 8 {Drosophila MICAL long isoform), SEQ ID NO: 10 {Drosophila MICAL medium 
isoform), SEQ ID NO: 12 {Drosophila MICAL short isoform). 

[0266] The neuronal lineage cell can be for example, a neuron or an oligodendrocyte {J. 
Neurosci. Jul 15;22(14):5992-6004 (2002)). Furthermore, the neuronal lineage cell can be a 
neural stem cell, or a neuronal cell derived from an isolated stem cell. In this aspect, the 
stem cell can be introduced into a subject at the site of a spinal cord injury, and contacted 
with the agent before introduction into the subject, and/or at the site of the spinal cord 
injury. Accordingly, the cell can be contacted with the agent in vivo or in vitro. 

[0267] In another embodiment, the present invention provides a method for affecting a 
plexin-mediated process or a semaphorin-mediated process, that includes contacting a cell 
that carries out the plexin-mediate process or semaphorin-mediated process, such as a cell 
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expressing a MICAL-polypeptide of the present invention, with an effective amount of an 
agent, for example an antioxidant such as a monooxygenase inhibitor, that affects a MICAL 
polypeptide activity. The agent affects the plexin-mediated process or the semaphorin- 
mediated process. For example, where the plexin-mediated process is axonal regrowth, the 
agent is chronically contacted with the cell. 

[0268] For this embodiment of the invention, virtually any cell can be used. For 
example a recombinant cell that expresses a MICAL polypeptide can be used. The 
recombinant cell can be obtained using a vector of the present invention and transfection or 
transformation methods well known in the art. The cell in this embodiment of the 
invention can be a cell of the nervous system, such as a neuron, an immune cell, a cancer 
cell, and a cardiac cell, particularly a cardiac neural crest. 

[0269] As is indicated herein by the analysis of MICAL proteins in a model invertebrate 
system (Drosophila) and in the vertebrate nervous system, the interactions between all 
semaphorins, both secreted and transmembrane, with plexin family members are likely to 
involve essential interactions and functions provided by MICAL proteins. The present 
disclosure suggests that the N-terminal MICAL monooxygenase domain is essential for 
semaphorin/plexin-mediated neuronal repulsion. Therefore, MICALs and the subclass of 
monooxygenase they belong to are targets with regard to neuronal regeneration following 
injury and various strategies designed to promote neuronal regeneration following 
neurodegeneration. Additionally, since plexins bind homophilically to other plexins 
{Neuron, 14: 1 189-99 (1995)) they may signal in a semaphorin-independent, but MICAL- 
dependent, manner. 

[0270] The cell included in methods of the present invention, including methods for 
affecting a plexin-mediated or semaphorin-mediated process, in certain aspects is a cell of 
the immune system. For example, the cell of the immune system can be a lymphocyte, such 
as a B-cell, a T-cell, or precursor thereof, a monocyte, or a phagocyte. Other cell types for 
methods of the present invention include cancer cells, particularly metastatic cancer cells, 
and cardiac cells, particularly cardiac cells from the neural crest. In certain aspects the 
plexin-mediated process is mediated by Plexin A or Plexin B. In certain aspects the 
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semaphorin-mediated process is mediated by Sema la, Sema 3a, Sema 7a, or a class 4 
semaphorin. 

[0271] Semaphorins and plexins have been shown to provide important functions in the 
immune system (See e.g., Ventura, A., and Pelicci, P. G., "Semaphorins: Green Light for 
Redox Signaling?" Science 's STKE, pgs. 1-3 (2002)). In addition to what appear to be a 
plexin-independent function of the class 4 semaphorin Sema4D (i.e. CD 100) in the 
enhancement of B-cell responses via the inactivation of CD72, viral semaphorin interactions 
with plexin CI (aka VESPR) induce robust responses in human monocytes (i.e. cell 
aggregation and the expression of pro-inflammatory cytokines). The mammalian 
orthologue of one of these viral semaphorins, called Sema7A, may therefore also be 
involved in immunomodulation events. Further, the interactions between class 4 
semaphorins such as CD 100 with B class plexins, interactions which are well-characterized 
in the vertebrate neurons, may function in the immune system. The expression of all class 
B plexins in the immune system has not been exhaustively determined, and it is likely that 
neuronal class 4 semaphorin/Plexin B functions have immune system counterparts. 

[0272] Finally, the initial characterization of MICAL as a CasL interacting protein was 
carried out using human thymus cells as a starting source of tissue. Cas family members are 
important for TCR and bl integrin-induced immunological reactions in lymphocytes, 
including interleukin-2 production and various migratory responses. Thus, though at 
present it is difficult to predict exact roles in immune system function, it is likely that 
MICALs will be involved in plexin immune system function, that modulation of Cas protein 
function will follow, and that the disclosure herein with respect to characterization of 
flavonoids which are likely MICAL antagonists will at the least provide a set of compounds 
with potent effects on immune system function. 

[0273] The plexin-mediated process or the semaphorin-mediated process can be 
semaphorin la-PlexA-mediated repulsive axon guidance. In another aspect, however, the 
plexin-mediated process or semaphorin-mediated process affected by antioxidants in the 
methods of the present invention can include cell migration, for example the migration of 
cancer cells (see e.g., Trusolino, L., and Comoglio, P.M., Nat., Rev. Cancer, 2:289 (2002)). 
Studies by J. Mina and colleagues, for example, have implicated class 3 semaphorins in 
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certain small cell lung carcinomas. Sema3F, and more recently the closely linked gene for 
Sema3B, have both been associated with genetic lesions in lung cancer cell lines, and the 
most recent evidence suggests that SemaSB may be a key determinant in these lung cancers. 
Given demonstrations of class 3 semaphorin mediated effects on neural crest cell 
morphology and GABAergic cortical neuronal migration, coupled with the in vitro 
semaphorin collapse assays recently developed in tissue culture cell lines (COS cells, Cell, 
99:59-69 (1999)), if seems reasonable to think that non-neuronal cell adhesivity and 
migration are influenced by these secreted repellents. Based upon the present disclosure, 
which implicate MICAL monooxygenase function in Sema3 A repulsion in vertebrates, it is 
expected that class 3 semaphorin function can be either attenuated or enhanced by using 
anti-oxidants that affect MICAL function. 

[0274] Class 4 semaphorins, which as described are likely to play key roles in the 
immune system, have also recently been implicated in regulating invasive growth by 
Comoglio and colleagues. Sema3F in a liver cell line apparently can mediate invasive 
growth via a coupling of PlexinBl and Met receptor signaling. This requirement for a 
plexin and an associated membrane bound co-receptor component for a functional 
semaphorin response is reminiscent in certain respects of observations in Drosophila from 
Goodman and colleagues showing that Plexin A requires offtrack (OTK), a protein related 
to receptor tyrosine kinases, for plexin-mediated semaphorin repulsion. This work suggests 
that Plexin Bl, and perhaps many other plexins, can regulate cell migration, either 
positively or negatively. It is interesting to note that recent work by T. Hunter and 
colleagues demonstrates a requirement for both FAK and CasL function for Ephrin/Eph 
receptor mediated attractive responses. 

[0275] A role has also been established for plexinA2 and semaphorin signaling in 
cardiac neural crest cells. For example, studies have shown that PlexinA2 is expressed in 
migrating and postmigrating cardiac neural crest cells in the mouse (Brown et aL, 
Development 128:3071 (2001)). Furthermore, it has been shown that PlexinA2-expressing 
cardiac neural crest cells are patterned abnormally in several mutant mouse lines with 
congenital heart disease including those lacking Semaphorin 3C. (Id). 
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[0276] Additionally, reports have suggested a role of plexins in numerous other diseases. 
For example, studies have revealed that at least certain semaphores are dysregulated and/or 
downregulated in patients with Alzheimer's Disease and Down's Syndrome. (Andorn, A. 
C, and Kalaria, R.N., Acta. Neurochir. Suppl. (Wien) 70:212-5 (1997); Lubec, G., J Neural 
Transm Suppl, 57:161-77 (1999); and Hirsch et al., Brain Res., 27:67-79 (1999)). Other 
studies have identified a plexin family member as involved in polycystic kidney and hepatic 
disease (Onuchic, L.F., et al., Pediatr. Res. 52:830 (2002)). Furthermore, single nucleotide 
polymorphisms (SNPs) related to Rett syndrome have been identified in a Plexin gene 
(Dahle, A. R., et al., Am. J. Med. Genet. 3:69 (2000)). 

[0277] Finally, the very well documented roles of Cas family members in regulating 
non-neuronal cell morphology and various growth characteristics also suggest 
understanding that MICALs are linked to CasL could be invaluable for controlling certain 
cellular behaviors. pl30Cas was first identified and a target of hyperphosphorylation by 
oncogenic Src and Crk. Since then, many in vitro experiments have implicated Cas proteins 
in the maintenance of the transformed cell state, however to date in vivo support for Cas 
function in cell transformation is still lacking. Nevertheless, the demonstrated roles for Cas 
proteins in cell cycle progression, the regulation of cell shape, and for the induction of cell 
migration, make this protein an attractive target for controlling a myriad of cellular 
functions, from osteoclast activation to vasculogenesis and angiogenesis. How MICAL 
regulates Cas is at present unknown and is a major focus for our own research. However, 
given well-established interactions between Cas family members and a variety of proteins 
essential for the establishment and maintenance of cell shape, cell-cell interactions, and 
morphological changes including migration (including, but not limited to, FAK, RAFTK, 
Crk, Fyn, Yes, Abl, Grb2, several phosphatases including PTP1B, Nek, and Src), it is likely 
that MICALs play a role in these processes through Cas interactions. 

[0278] A method of the invention for the various embodiments that include contacting a 
cell or a MICAL with an agent such as an antioxidant, can be performed, for example, by 
contacting under suitable conditions a target cell and an agent that affects a MICAL 
function such as MICAL axon guidance regulatory activity. Suitable conditions can be 
provided by placing the cell, which can be an isolated cell or can be a component of a tissue 
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or organ, in an appropriate culture medium, or by contacting the cell in situ in an organism. 
For example, a medium containing the cell can be contacted with an agent the affects the 
ability of a MICAL to specifically interact with a plexin expressed on the cell, or with an 
agent that affects MIXAL axon guidance regulatory activity in the cell. In general, the cell 
is a component of a tissue or organ in a subject, in which case contacting the cell can 
comprise administering the agent to the subject. However, the cell also can be manipulated 
in culture, then can be maintained in culture, administered to a subject, or used to produce a 
transgenic nonhuman animal. 

[0279] An agent useful in a method of the invention can be any type of molecule, for 
example, a polynucleotide, a peptide, a peptidomimetic, peptoids such as vinylogous 
peptoids, a small organic molecule, or the like, and can act in any of various ways to affect a 
MICAL function such as axon guidance regulatory activity. The agent can act to alter a 
semaphorin-mediated pathway in the cell. In addition, the agent can be an agonist, which 
mimics or enhances the effect of MICAL on a cell, for example, the ability of MICAL to 
specifically interact with a plexin, thereby increasing MICAL axon guidance regulatory 
activity in the cell; or can be an antagonist, which reduces or inhibits the effect of MICAL 
on a cell, thereby reducing or inhibiting MICAL axon guidance regulatory activity. For 
example, administration of an antagonist can result in axon growth and new positioning to 
allow formation of new targets by inhibiting semaphorin-mediated axonal repulsion activity. 

[0280] As used herein, the term "specific interaction" or "specifically binds" or the like 
means that two molecules form a complex that is relatively stable under physiologic 
conditions. The term is used herein in reference to various interactions, including, for 
example, the interaction of MICAL and a plexin such as plexin A, the interaction of the 
intracellular components of a semaphorin-mediated pathway, and the interaction of an 
antibody and its antigen. A specific interaction can be characterized by a dissociation 
constant of at least about 1 x 10' 6 M, generally at least about 1 x 10" 7 M, usually at least 
about 1 x 10" 8 M, and particularly at least about 1 x 10" 9 Mor 1 x 10" 10 M or greater. A 
specific interaction generally is stable under physiological conditions, including, for 
example, conditions that occur in a living individual such as a human or other vertebrate or 
invertebrate, as well as conditions that occur in a cell culture such as used for maintaining 
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mammalian cells or cells from another vertebrate organism or an invertebrate organism. 
Methods for determining whether two molecules interact specifically are well known and 
include, for example, equilibrium dialysis, surface plasmon resonance, and the like. 

[0281] An agent that alters a specific interaction of a MICAL with a plexin can act, for 
example, by binding to a MICAL such that it cannot interact specifically with the plexin, by 
competing with MICAL for binding to the plexin, or by otherwise by-passing the 
requirement that MICAL specifically interact with a plexin in order to regulate axonal 
guidance. A mutant plexin that retains its ability to bind to a MICAL but not other plexin 
functions is an example of an agent that can bind a MICAL, thereby sequestering it and 
reducing or inhibiting its ability to interact specifically with a functional plexin. A MICAL 
mutant that includes only its plexin interacting region is an example of an agent that can 
compete with MICAL for plexin binding, thereby reducing or inhibiting the ability of the 
MICAL to interact specifically with a plexin. Such MICAL antagonists are useful in 
practicing a method of the invention, particularly for reducing or inhibiting MICAL axon 
guidance regulatory activity. 

[02821 An agent useful in a method of the invention an antibody that specifically binds a 
MICAL, including all or a portion of the plexin interacting region, thereby preventing 
MICAL from interacting specifically with a plexin. Alternatively, the agent can be an 
antibody that specifically binds to a plexin, including all or a portion of the MICAL 
interacting region, thereby preventing the plexin from interacting specifically with a 
MICAL. Such an anti-MICAL or anti-plexin antibody can be selected for its ability to 
specifically bind MICAL or plexin, respectively, without activating MICAL axon guidance 
regulatory activity, and can be useful as a MICAL antagonist for reducing or inhibiting 
MICAL axon guidance regulatory activity; or can be selected for its ability to specifically 
bind MICAL or plexin, respectively and activate axon guidance regulatory activity, thus 
acting as a MICAL agonist. The antibody can be raised using a MICAL or a plexin, 
including plexin A as an immunogen, or can be an anti-idiotype antibody, which is raised 
against an anti-MICAL antibody and mimics MICAL. 
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[0283] An agent useful in a method of the invention also can be an agent that reduces 
MICAL monooxygenase activity, thereby reducing or inhibiting MICAL axon guidance 
regulatory activity. 

[0284] In addition, an agent useful in a method of the invention can be a mutant plexin, 
which, for example, lacks semaphorin signal transduction activity in response to MICAL 
binding, or has constitutive semaphorin signal transduction activity. For example, a mutant 
plexin can have a point mutation, a deletion, or the like in a functional domain other than 
the MICAL binding domain. Such a dominant negative mutant plexin lacks the ability to 
transmit a semaphorin and/or MICAL signal despite the fact that it can specifically bind a 
MICAL. 

[0285] An agent useful in a method of the invention also can modulate the level or 
activity of a MICAL. 

[0286] The specific interaction of MICAL with plexin A indicates that MICAL axonal 
guidance regulatory activity can involve components of the semaphorin-plexin mediated 
repulsive axon guidance pathway. Thus, the Semaphorin repulsive axon guidance pathway 
provides a target for modulating the effect of MICAL on a cell, and agents that affect the 
Semaphorin pathway can be useful for modulating MICAL axon guidance regulatory 
activity. 

[0287] Antagonist agents that can reduce or inhibit MICAL axon guidance regulatory 
activity are exemplified by dominant negative MICAL polypeptides in which the a 
functional domain other than the plexin-interacting region has been mutated. The mutants 
include polypeptides that include a plexin-interacting region and no other 

[0288] Where the agent that acts intracellularly is a peptide or a polypeptide, it can be 
contacted with the cell directly, or a polynucleotide encoding the peptide (or polypeptide) 
can be introduced into the cell and the peptide can be expressed in the cell. It is recognized 
that some of the peptides useful in a method of the invention are relatively large and, therefore, 
may not readily traverse a cell membrane. However, various methods are known for 
introducing a peptide into a cell. The selection of a method for introducing such a peptide into 
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a cell will depend, in part, on the characteristics of the target cell, into which the polypeptide is 
to be provided. For example, where the target cells, or a few cell types including the target 
cells, express a receptor, which, upon binding a particular ligand, is internalized into the cell, 
the peptide agent can be operatively associated with the ligand. Upon binding to the receptor, 
the peptide is translocated into the cell by receptor-mediated endocytosis. The peptide agent 
also can be encapsulated in a liposome or formulated in a lipid complex, which can facilitate 
entry of the peptide into the cell, and can be further modified to express a receptor (or ligand), 
as above. The peptide agent also can be introduced into a cell by engineering the peptide to 
contain a protein transduction domain such as the human immunodeficiency virus TAT protein 
transduction domain, which facilitates translocation of the peptide into the cell (see Schwarze 
et al., Science 285:1569-1572 (1999), which is incorporated herein by reference; see, also, 
Derossi et al, J, Biol Chem. 271:18188 (1996)). The target cell also can be contacted with a 
polynucleotide encoding the peptide or polypeptide agent, which can be expressed in the 
cell. 

[0289] An agent useful in a method of the invention can be a polynucleotide, which can 
be contacted with or introduced into a cell as described above. Generally, but not 
necessarily, the polynucleotide is introduced into the cell, where it effects its function either 
directly, or following transcription or translation or both. For example, as discussed above, 
the polynucleotide can encode a polypeptide agent, which is expressed in the cell and 
modulates MICAL activity. Such an expressed polypeptide can be, for example, a mutant 
MICAL polypeptide, which does not have monooxygenase activity; or can be a mutant 
plexin. Methods for introducing a polynucleotide into a cell are exemplified below or 
otherwise known in the art. 

[0290] A polynucleotide agent useful in a method of the invention also can be, or can 
encode, an antisense molecule, a ribozyme or a triplexing agent. For example, the 
polynucleotide can be (or can encode) an antisense nucleotide sequence such as an antisense 
MICAL, plexin, or semaphorin sequence, which can act as an antagonist to reduce or inhibit 
MICAL axon guidance regulatory activity, thereby inhibiting semaphorin-mediated 
repulsive axon guidance. Such polynucleotides can be contacted directly with a target cell 
and, upon uptake by the cell, can effect their antisense, ribozyme or triplexing activity; or 
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can be encoded by a polynucleotide that is introduced into a cell, whereupon the 
polynucleotide is expressed to produce, for example, an antisense RNA molecule or 
ribozyme, which effects its activity. 

[0291] An antisense polynucleotide, ribozyme or triplexing agent is complementary to a 
target sequence, which can be a DNA or RNA sequence, for example, messenger RNA, and 
can be a coding sequence, a nucleotide sequence comprising an intron-exon junction, a 
regulatory sequence such as a Shine-Delgarno sequence, or the like. The degree of 
complementarity is such that the polynucleotide, for example, an antisense polynucleotide, 
can interact specifically with the target sequence in a cell. Depending on the total length of 
the antisense or other polynucleotide, one or a few mismatches with respect to the target 
sequence can be tolerated without losing the specificity of the polynucleotide for its target 
sequence. Thus, few if any mismatches would be tolerated in an antisense molecule 
consisting, for example, of 20 nucleotides, whereas several mismatches will not affect the 
hybridization efficiency of an antisense molecule that is complementary, for example, to the 
full length of a target mRNA encoding a cellular polypeptide. The number of mismatches 
that can be tolerated can be estimated, for example, using well known formulas for 
determining hybridization kinetics (see Sambrook et al., supra, 1989) or can be determined 
empirically using methods as disclosed herein or otherwise known in the art, particularly by 
determining that the presence of the antisense polynucleotide, ribozyme, or triplexing agent 
in a cell decreases the level of the target sequence or the expression of a polypeptide 
encoded by the target sequence in the cell. 

[0292] A polynucleotide useful as an antisense molecule, a ribozyme or a triplexing 
agent can inhibit translation or cleave the nucleic acid molecule, thereby modulating 
MYCAL axon guidance regulatory activity in a cell. An antisense molecule, for example, 
can bind to an mRNA to form a double stranded molecule that cannot be translated in a cell. 
Antisense oligonucleotides of at least about 15 to 25 nucleotides are preferred since they are 
easily synthesized and can hybridize specifically with a target sequence, although longer 
antisense molecules can be expressed from a polynucleotide introduced into the target cell. 
Specific nucleotide sequences useful as antisense molecules can be identified using well 
known methods, for example, gene walking methods (see, for example, Seimiya et al., 
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J. Biol Chem. 272:4631-4636 (1997), which is incorporated herein by reference). Where 
the antisense molecule is contacted directly with a target cell, it can be operatively 
associated with a chemically reactive group such as iron-linked EDTA, which cleaves a 
target RNA at the site of hybridization. A triplexing agent, in comparison, can stall 
transcription (Maher et aL, Antisense Res. Devel 1 :227 (1991); Helene, Anticancer Drug 
Design 6:569 (1991)). Thus, a triplexing agent can be designed to recognize, for example, a 
sequence of a MIC AL gene regulatory element, thereby reducing or inhibiting the 
expression of a MIC AL polypeptide in the cell, and modulating MICAL axon guidance 
regulatory activity in a target cell. 

[0293] The agent to be administered to the subject is administered under conditions that 
facilitate contact of the agent with the target cell and, if appropriate, entry into the cell. 
Entry of a polynucleotide agent into a cell, for example, can be facilitated by incorporating 
the polynucleotide into a viral vector that can infect the cells. If a viral vector specific for 
the cell type is not available, the vector can be modified to express a receptor (or ligand) 
specific for a ligand (or receptor) expressed on the target cell, or can be encapsulated within 
a liposome, which also can be modified to include such a ligand (or receptor). A 
polypeptide agent can be introduced into a cell by various methods, including, for example, 
by engineering the peptide to contain a protein transduction domain such as the human 
immunodeficiency virus TAT protein transduction domain, which can facilitate 
translocation of the peptide into the cell (see Schwarze et al., supra, 1999; Derossi et aL, 
supra, 1996). 

[0294] The presence of the agent in the target cell can be identified directly, for example, 
by operatively linking a detectable label to the agent, by using an antibody specific for the 
agent, particularly a polypeptide agent, or by detecting a downstream effect due to the 
agent, for example, decreased semaphorin-mediated axon repulsion in the cell. An agent 
can be labeled so as to be detectable using methods well known in the art (Hermanson, 
"Bioconjugate Techniques" (Academic Press 1996), which is incorporated herein by 
reference; see, also, Harlow and Lane, supra, 1988). For example, a peptide or 
polynucleotide agent can be labeled with various detectable moieties including a radiolabel, 
an enzyme such as alkaline phosphatase, biotin, a fluorochrome, and the like. Where the 
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agent is contained in a kit, the reagents for labeling the agent also can be included in the kit, 
or the reagents can be purchased separately from a commercial source. 

[0295] An agent useful in a method of the invention can be administered to the site of 
the pathologic condition, or can be administered by any method that provides the target cells 
with the polynucleotide or peptide. As used herein, the term "target cells" typical means an 
immune cell, a transformed eukaryotic cell, or a cell of the nervous system, for example a 
neuron, that are to be contacted with the agent. For administration to a living subject, the 
agent generally is formulated in a pharmaceutical composition suitable for administration to 
the subject. Thus, the invention provides pharmaceutical compositions containing an agent, 
which is useful for modulating MICAL axonal guidance regulatory activity in a cell, in a 
pharmaceutically acceptable carrier. As such, the agents are useful as medicaments for 
treating a subject suffering from a pathological condition as defined herein. 

[0296] Pharmaceutically acceptable carriers are well known in the art and include, for 
example, aqueous solutions such as water or physiologically buffered saline or other 
solvents or vehicles such as glycols, glycerol, oils such as olive oil or injectable organic 
esters. A pharmaceutically acceptable carrier can contain physiologically acceptable 
compounds that act, for example, to stabilize or to increase the absorption of the conjugate. 
Such physiologically acceptable compounds include, for example, carbohydrates, such as 
glucose, sucrose or dextrans, antioxidants, such as ascorbic acid or glutathione, chelating 
agents, low molecular weight proteins or other stabilizers or excipients. One skilled in the 
art would know that the choice of a pharmaceutically acceptable carrier, including a 
physiologically acceptable compound, depends, for example, on the physico-chemical 
characteristics of the therapeutic agent and on the route of administration of the 
composition, which can be, for example, orally or parenterally such as intravenously, and by 
injection, intubation, or other such method known in the art. The pharmaceutical 
composition also can contain a second reagent such as a diagnostic reagent, nutritional 
substance, toxin, or therapeutic agent, for example, a cancer chemotherapeutic agent. 

[0297] The agent can be incorporated within an encapsulating material such as into an 
oil-in- water emulsion, a microemulsion, micelle, mixed micelle, liposome, microsphere or 
other polymer matrix (see, for example, Gregoriadis, Liposome Technology, Vol. 1 (CRC 
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Press, Boca Raton, FL 1984); Fraley, et al., Trends Biochem. Sci., 6:77 (1981), each of 
which is incorporated herein by reference). Liposomes, for example, which consist of 
phospholipids or other lipids, are nontoxic, physiologically acceptable and metabolizable 
carriers that are relatively simple to make and administer. "Stealth" liposomes (see, for 
example, U.S. Patent Nos. 5,882,679; 5,395,619; and 5,225,212, each of which is 
incorporated herein by reference) are an example of such encapsulating materials 
particularly useful for preparing a pharmaceutical composition useful for practicing a 
method of the invention, and other "masked" liposomes similarly can be used, such 
liposomes extending the time that the therapeutic agent remain in the circulation. Cationic 
liposomes, for example, also can be modified with specific receptors or ligands (Morishita 
et al., J. Clin. Invest., 91 :2580-2585 (1993), which is incorporated herein by reference). In 
addition, a polynucleotide agent can be introduced into a cell using, for example, 
adenovirus-polylysine DNA complexes (see, for example, Michael et al., J. Biol. Chem. 
268:6866-6869 (1993), which is incorporated herein by reference). 

[0298] The route of administration of a pharmaceutical composition containing an agent 
that alters MICAL axon guidance regulatory activity such as semaphorin-mediated axon 
repulsion activity, will depend, in part, on the chemical structure of the molecule. 
Polypeptides and polynucleotides, for example, are not particularly useful when 
administered orally because they can be degraded in the digestive tract. However, methods 
for chemically modifying polypeptides, for example, to render them less susceptible to 
degradation by endogenous proteases or more absorbable through the alimentary tract are 
well known (see, for example, Blondelle et al., supra, 1995; Ecker and Crook, supra,'\995). 
In addition, a peptide agent can be prepared using D-amino acids, or can contain one or 
more domains based on peptidomimetics, which are organic molecules that mimic the 
structure of peptide domain; or based on a peptoid such as a vinylogous peptoid. 

[0299] A pharmaceutical composition as disclosed herein can be administered to an 
individual by various routes including, for example, orally or parenterally, such as 
intravenously, intramuscularly, subcutaneously, intraorbitally, intracapsularly, 
intraperitoneally, intrarectally, intracisternally or by passive or facilitated absorption through 
the skin using, for example, a skin patch or transdermal iontophoresis, respectively. 
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Furthermore, the pharmaceutical composition can be administered by injection, intubation, 
orally or topically, the latter of which can be passive, for example, by direct application of 
an ointment, or active, for example, using a nasal spray or inhalant, in which case one 
component of the composition is an appropriate propellant. Furthermore, the agent can be 
delivered by intrathecal administration using a pump to administer the agent over a period 
of time. 

[0300] A pharmaceutical composition also can be administered to the site of a pathologic 
condition, for example, intravenously or intra-arterially into a blood vessel supplying a 
tumor, or by direct injection into the central nervous system., or a portion thereof such as 
the spinal cord. In aspects of the invention wherein the agent is intended to be delivered to 
the spinal cord, the agent can be an agent that is capable of crossing into the spinal cord 
from the blood stream. Such agents include anti-oxidant flavonoids discussed herein. 

[0301] The total amount of an agent to be administered in practicing a method of the 
invention can be administered to a subject as a single dose, either as a bolus or by infusion 
over a relatively short period of time, or can be administered using a fractionated treatment 
protocol, in which multiple doses are administered over a prolonged period of time. One 
skilled in the art would know that the amount of the pharmaceutical composition to treat a 
pathologic condition in a subject depends on many factors including the age and general 
health of the subject as well as the route of administration and the number of treatments to 
be administered. In view of these factors, the skilled artisan would adjust the particular 
dose as necessary. In general, the formulation of the pharmaceutical composition and the 
routes and frequency of administration are determined, initially, using Phase I and Phase II 
clinical trials. 

[0302] In general, in methods of the present invention, an agent is administered in an 
amount that is sufficient to modulate axonal guidance regulatory activity, monooxygenase 
activity, or plexin-interacting activity. It will be recognized that routine methods can be 
used to identify effective amounts. 

[0303] The pharmaceutical composition can be formulated for oral formulation, such as 
a tablet, or a solution or suspension form; or can comprise an admixture with an organic or 
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inorganic carrier or excipient suitable for enteral or parenteral applications, and can be 
compounded, for example, with the usual non-toxic, pharmaceutically acceptable carriers 
for tablets, pellets, capsules, suppositories, solutions, emulsions, suspensions, or other form 
suitable for use. The carriers, in addition to those disclosed above, can include glucose, 
lactose, mannose, gum acacia, gelatin, mannitol, starch paste, magnesium trisilicate, talc, 
com starch, keratin, colloidal silica, potato starch, urea, medium chain length triglycerides, 
dextrans, and other carriers suitable for use in manufacturing preparations, in solid, 
semisolid, or liquid form. In addition auxiliary, stabilizing, thickening or coloring agents 
and perfumes can be used, for example a stabilizing dry agent such as triulose (see, for 
example, U.S. Patent No. 5,314,695). 

[0304] In certain embodiments, the present invention provides detection methods. For 
example, in one embodiment, the present invention provides a method of detecting an 
immune disease or disorder in a subject. The method includes determining in the subject, 
the level of expression of a polynucleotide and/or polypeptide of the present invention, such 
as a MICAL polynucleotide or a MICAL polypeptide. An increased or a decreased level of 
expression or activity can be indicative of the immune disease or disorder. 

[0305] In another embodiment, the present invention provides a method of detecting 
cancer in a subject. The method includes determining in the subject, the level of expression 
of a polynucleotide and/or polypeptide of the present invention, such as a MICAL 
polynucleotide or a MICAL polypeptide. An increased level of expression or activity is 
indicative of the cancer. 

[0306] In another method, the present invention provides a method of assessing the 
invasiveness of cancer cells in a subject. The method includes determining in the subject, 
or, more particularly, in cancer cells from the subject, the level of expression of a 
polynucleotide and/or polypeptide of the present invention, such as a MICAL 
polynucleotide or a MICAL polypeptide. An increased level of expression or activity 
indicates that the cancer cells are invasive. 
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[0307] In another embodiment, the present invention provides a method of detecting 
central nervous system injury and/or peripheral nervous system injury in a subject, that 
includes determining the level of expression of a polynucleotide of the present invention, or 
a polypeptide of the present invention or a MICAL activity thereof, in a sample from the 
central nervous system or peripheral nervous system of the subject. It has been found that 
MICAL expression is increased upon injury to the nervous system. Accordingly, an 
increased level of expression or activity identified by a method of the invention is indicative 
of the central nervous system injury and/or peripheral nervous system. 

[0308] A sample of the central nervous system can be obtained using known methods. 
The sample can include for example, spinal fluid or neurons from the spinal cord at a 
suspected site of injury. The subject can be a human. The increased level can be identified 
by comparing the determined level to a level of a subject not suspected of suffering from 
spinal cord injury. 

[0309] The MICAL activity can be any of the MICAL activities, including, for example, 
monooxygenase activity, axon guidance regulatory activity, plexin interacting activity, or 
binding to SH-3 domain-containing proteins. Methods for detecting these activity, some of 
which are provided herein, are known in the art. Furthermore, methods for determining the 
level of expression of a polynucleotide or a polypeptide of the present invention, examples 
of which are provided below, are known in the art. 

[0310] Cells obtained in the sample for any of the methods for detecting or assessing of 
the present invention can be contacted with a lysis buffer. The sample obtained can then be 
further processed, for example to isolate nucleic acids or polypeptides. 

[031 1] Nucleic can be isolated from the lysed cells and cellular material by any number 
of means well known in the art. For example, a number of commercial products are 
available for isolating polynucleotides, including but not limited to, TriReagent (Molecular 
Research Center, Inc, Cincinnati, OH ). The isolated polynucleotides can then be assayed 
for the presence of a polynucleotide that encodes a MICAL or MICAL-Like polypeptide. 
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[0312] Analyzing expression of a MICAL polypeptide or a nucleotide encoding a 
MICAL polypeptide includes any qualitative or quantitative method for detecting 
expression of a gene, many of which are known in the art. Non-limiting methods for 
analyzing polynucleotides and polypeptides are discussed below. 

[0313] The methods of analyzing expression of MICAL or a MICAL-Like polypeptide 
of the present invention can utilize a biochip, or other miniature high-throughput 
technology. The manufacture and use of biochips such as those involving bioarrays, are 
known in the art and commercially available (See e.g., bio arrays available from Sigma- 
Genosys (The Woodlands, TX); Affymetrix (Santa Clara, CA), and Full Moon Biosystems 
(Sunnyvale, CA)) (For reviews of Biochips and bioarrays see, e.g., Kallioniemi O.P., 
"Biochip technologies in cancer research/ 5 Ann Med, Mar; 33(2): 142-7 (2001); and Rudert 
F., "Genomics and proteomics tools for the clinic," Curr Opin. Mol. Ther., Dec;2(6):633-42 
(2000)). 

[0314] Such bioarrays can be analyzed using blotting techniques similar to those 
discussed below for conventional techniques of detecting polynucleotides and polypeptides. 
Other microfluidic devices and methods for analyzing gene expression can be used for the 
methods of the present invention. 

[0315] Quantitative measurement of expression levels using bioarrays is also known in 
the art, and typically involve a modified version of a traditional method for measuring 
expression as described herein. For example, such quantitation can be performed by 
measuring a phosphor image of a radioactive-labeled probe binding to a spot of a 
microarray, using a phospohor imager and imaging software. 

[0316] A method of the present invention in certain aspects, employs RNA, including 
messenger RNA (mRNA), isolated from a CNS sample. The RNA may be single stranded 
or double stranded. Enzymes and conditions optimal for reverse transcribing the template 
to DNA well known in the art can be used. Alternatively, the RNA can be subjected to 
RNAse protection assays. A DNA-RNA hybrid that contains one strand of each can also be 
used. A mixture of polynucleotides can also be employed, or the polynucleotides produced 
in a previous amplification reaction, using the same or different primers may be so used. In 
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the instance where the polynucleotide sequence is to he amplified the polynucleotide 
sequence may be a portion of a MICAL, or can be present initially as a discrete molecule, 
such that the specific sequence is the entire nucleic acid. It is not necessary that the 
sequence to be amplified be present initially in a pure form; it may be a minor fraction of a 
complex mixture. 

[03 17] In addition, RNAse protection assays can be used if RNA is the polynucleotide 
obtained from the sample. In this procedure, a labeled antisense RNA probe is hybridized 
to the complementary polynucleotide in the sample. The remaining unhybridized single- 
stranded probe is degraded by ribonuclease treatment. The hybridized, double stranded 
probe is protected from RNAse digestion. After an appropriate time, the products of the 
digestion reaction are collected and analyzed on a gel (see for example Ausubel et al, 
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, section 4.7.1 (1987)). As used 
herein, "RNA probe" refers to a ribonucleotide capable of hybridizing to RNA in a sample 
of interest. Those skilled in the art will be able to identify and modify the RNAse 
protection assay specific to the polynucleotide to be measured, for example, probe 
specificity may be altered, hybridization temperatures, quantity of nucleic acid etc. 
Additionally, a number of commercial kits are available, for example, RiboQuantTM Multi- 
Probe RNAse Protection Assay System (Pharmingen, Inc., San Diego, CA). 

[0318] In another embodiment, the polynucleotide in the sample may be analyzed by a 
blotting procedure, typically a Northern blot procedure, as illustrated in the Examples 
herein. For blotting procedures polynucleotides are separated on a gel and then probed with 
a complementary polynucleotide to the sequence of interest. For example, RNA is 
separated on a gel transferred to nitrocellulose and probed with complementary DNA that is 
derived from a MICAL gene. The complementary probe may be labeled radioactively, 
chemically etc. Hybridization of the probe is indicative of the expression of the MICAL. 

[0319] Detection of a polynucleotide encoding a MICAL can be performed by standard 
methods such as size fractionating the nucleic acid. Methods of size fractionating the DNA 
and RNA are well known to those of skill in the art, such as by gel electrophoresis, 
including polyacrylamide gel electrophoresis (PAGE). For example, the gel may be a 
denaturing 7 M or 8 M urea-polyacrylamide-formamide gel. Size fractionating the nucleic 
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acid may also be accomplished by chromatographic methods known to those of skill in the 
art. 

[0320] The detection of polynucleotides may optionally be performed by using 
radioactively labeled probes. Any radioactive label may be employed which provides an 
adequate signal. Other labels include ligands, colored dyes, and fluorescent molecules, 
which can serve as a specific binding pair member for a labeled ligand, and the like. The 
labeled preparations are used to probe for a polynucleotide by the Southern or Northern 
hybridization techniques, for example. Nucleotides obtained from samples are transferred 
to filters that bind polynucleotides. After exposure to the labeled polynucleotide probe, 
which will hybridize to nucleotide fragments containing target nucleic acid sequences, the 
binding of the radioactive probe to target nucleic acid fragments is identified by 
autoradiography (see Genetic Engineering, 1 ed. Robert Williamson, Academic Press 
(1981), pp. 72-81). The particular hybridization technique is not essential to the invention. 
Hybridization techniques are well known or easily ascertained by one of ordinary skill in 
the art. As improvements are made in hybridization techniques, they can readily be applied 
in the method of the invention. 

[0321] Probes according to the present invention and used in a method of the present 
invention selectively hybridize to a polynucleotide encoding a MICAL polypeptide. In 
preferred aspects, the probes are spotted on a bio array using methods known in the art. 

[0322] The polynucleotides encoding a MICAL may be amplified before they are 
detected. The term "amplified" refers to the process of making multiple copies of the 
nucleic acid from a single polynucleotide molecule. The amplification of polynucleotides 
can be carried out in vitro by biochemical processes known to those of skill in the art. The 
amplification agent may be any compound or system that will function to accomplish the 
synthesis of primer extension products, including enzymes. Suitable enzymes for this 
purpose include, for example, E. coli DNA polymerase I, Taq polymerase, Klenow 
fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available DNA 
polymerases, polymerase muteins, reverse transcriptase, ligase, and other enzymes, 
including heat-stable enzymes {i.e., those enzymes that perform primer extension after 
being subjected to temperatures sufficiently elevated to cause denaturation). Suitable 
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enzymes will facilitate combination of the nucleotides in the proper manner to form the 
primer extension products that are complementary to each mutant nucleotide strand. 
Generally, the synthesis will be initiated at the 3 '-end of each primer and proceed in the 5'- 
direction along the template strand, until synthesis terminates, producing molecules of 
different lengths. There may be amplification agents, however, that initiate synthesis at the 
5 '-end and proceed in the other direction, using the same process as described above. In 
any event, the method of the invention is not to be limited to the embodiments of 
amplification described herein. 

[0323] One method of in vitro amplification, which can be used according to this 
invention, is the polymerase chain reaction (PCR) described in U.S. Patent Nos. 4,683,202 
and 4,683,195. The term "polymerase chain reaction" refers to a method for amplifying a 
DNA base sequence using a heat-stable DNA polymerase and two oligonucleotide primers, 
one complementary to the (+)-strand at one end of the sequence to be amplified and the 
other complementary to the (-)-strand at the other end. Because the newly synthesized 
DNA strands can subsequently serve as additional templates for the same primer sequences, 
successive rounds of primer annealing, strand elongation, and dissociation produce rapid 
and highly specific amplification of the desired sequence. The polymerase chain reaction is 
used to detect the presence of polynucleotides encoding cytokines in the sample. Many 
polymerase chain methods are known to those of skill in the art and may be used in the 
method of the invention. For example, DNA can be subjected to 30 to 35 cycles of 
amplification in a thermocycler as follows: 95°C for 30 sec, 52° to 60°C for 1 min, and 
72°C for 1 min, with a final extension step of 72°C for 5 min. For another example, DNA 
can be subjected to 35 polymerase chain reaction cycles in a thermocycler at a denaturing 
temperature of 95°C for 30 sec, followed by varying annealing temperatures ranging from 
54-58°C for 1 min, an extension step at 70°C for 1 min and a final extension step at 70°C. 

[0324] The primers for use in amplifying the polynucleotides of the invention may be 
prepared using any suitable method, such as conventional phosphotriester and 
phosphodiester methods or automated embodiments thereof so long as the primers are 
capable of hybridizing to the polynucleotides of interest. One method for synthesizing 
oligonucleotides on a modified solid support is described in U.S. Patent No. 4,458,066. The 



WO 03/066821 



PCT/US03/03551 



101 

exact length of primer will depend on many factors, including temperature, buffer, and 
nucleotide composition. The primer must prime the synthesis of extension products in the 
presence of the inducing agent for amplification. 

[0325] Primers used according to the method of the invention are complementary to each 
strand of nucleotide sequence to be amplified. The term "complementary" means that the 
primers must hybridize with their respective strands under conditions, which allow the agent 
for polymerization to function. In other words, the primers that are complementary to the 
flanking sequences hybridize with the flanking sequences and permit amplification of the 
nucleotide sequence. Preferably, the 3' terminus of the primer that is extended has perfectly 
base paired complementarity with the complementary flanking strand. Primers and probes 
for polynucleotides encoding MICALs of the present invention, can be developed using 
known methods combined with the present disclosure. 

[0326] Those of ordinary skill in the art will know of various amplification 
methodologies that can also be utilized to increase the copy number of target nucleic acid. 
The polynucleotides detected in the method of the invention can be further evaluated, 
detected, cloned, sequenced, and the like, either in solution or after binding to a solid 
support, by any method usually applied to the detection of a specific nucleic acid sequence 
such as another polymerase chain reaction, oligomer restriction (Saiki et ah, \ 
Bio/Technology 3:1008-1012 (1985)), allele-specific oligonucleotide (ASO) probe analysis 
(Conner et ah, Proc. Natl Acad. Set USA 80; 278 (1983), oligonucleotide ligation assays 
(OLAs) (Landegren et al. 9 Science 241:1077 (1988)), RNAse Protection Assay and the like. 
Molecular techniques for DNA analysis have been reviewed (Landegren et al 9 Science 242: 
229-237 (1988)). Following DNA amplification, the reaction product may be detected by 
Southern blot analysis, without using radioactive probes. In such a process, for example, a 
small sample of DNA containing the polynucleotides obtained from the tissue or subject are 
amplified, and analyzed via a Southern blotting technique. The use of non-radioactive 
probes or labels is facilitated by the high level of the amplified signal. In one embodiment 
of the invention, one nucleoside triphosphate is radioactively labeled, thereby allowing 
direct visualization of the amplification product by autoradiography. In another 
embodiment, amplification primers are fluorescently labeled and run through an 
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electrophoresis system. Visualization of amplified products is by laser detection followed 
by computer assisted graphic display, without a radioactive signal. 

[0327] The methods of the present invention can involve a real-time quantitative PCR 
assay, such as a Taqman® assay (Holland et al., Proc. Natl Acad. Sci. USA, 88(16):7276 
(1991)). The assays can be performed on an instrument designed to perform such assays, 
for example those available from Applied Biosystems (Foster City, CA). Primers and 
probes for such an assay can be designed according to known procedures in the art. 

[0328] Simple visualization of a gel containing the separated products may be utilized to 
analyze polynucleotides encoding MICALs according to the methods of the present 
invention. For example, staining of a gel to visualize separated polynucleotides, a number 
of stains are well known to those skilled in the art. However, other methods known to those 
skilled in the art may also be used, for example scanning densitometry, computer aided 
scanning and quantitation as well as others. 

[0329] The method for detecting MICAL expression can alternatively employ the 
detection of a polypeptide product of one of these genes. The method for detecting a 
MICAL polypeptide in a cell is useful for detecting spinal cord injury by measuring the 
level of the MICAL polypeptide, in cells obtained from a subject suspected of having, or at 
risk of having spinal cord injury. The levels of MICALs are indicative of spinal cord injury 
when compared to a MICAL levels in a subject without spinal cord injury 

[0330] In this regard, the sample, as described herein, can be used as a source to isolate 
polypeptides. The MICAL polypeptide can then be quantified using methods known to 
those of skill in the art, for example by ELISA. 

[0331] Monoclonal antibodies to a particular polypeptide can be used in immunoassays, 
such as in liquid phase or bound to a solid phase carrier, to detect the MICAL polypeptide 
In addition, the monoclonal antibodies in these immunoassays can be detectably labeled in 
various ways. Examples of types of immunoassays that can utilize monoclonal antibodies 
of the invention are competitive and non-competitive immunoassays in either a direct or 
indirect format. Examples of such immunoassays are the radioimmunoassay (RIA) and the 
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sandwich (immunometxic) assay. Detection of the polypeptide antigens using the 
monoclonal antibodies of the invention can be done utilizing immunoassays, which are run 
in either the forward, reverse, or simultaneous modes, including immunohistochemical 
assays on physiological samples. Those of skill in the art will know, or can readily discern, 
other immunoassay formats without undue experimentation. In addition, there are a number 
of commercially available antibodies to cytokines of interest. 

[0332] The term "immunometric assay or "sandwich immunoassay" includes 
simultaneous sandwich, forward sandwich and reverse sandwich immunoassays. These 
terms are well understood by those skilled in the art. Those of skill will also appreciate that 
antibodies according to the present invention will be useful in other variations and forms of 
assays which are presently known or which may be developed in the future. These are 
intended to be included within the scope of the present invention. 

[0333] Monoclonal antibodies can be bound to many different carriers and used to detect 
the presence of a MICAL polypeptide. Examples of well-known carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified 
celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier can be either 
soluble or insoluble for purposes of the invention. Those skilled in the art will know of 
other suitable carriers for binding monoclonal antibodies, or will be able to ascertain such 
using routine experimentation. 

[0334] In performing the assays it may be desirable to include certain "blockers" in the 
incubation medium (usually added with the labeled soluble antibody). The "blockers" are 
added to assure that non-specific proteins, proteases, or anti-heterophilic immunoglobulins 
to anti-cytokine immunoglobulins present in the experimental sample do not cross-link or 
destroy the antibodies on the solid phase support, or the radiolabeled indicator antibody, to 
yield false positive or false negative results. The selection of "blockers" therefore may add 
substantially to the specificity of the assays. 

[0335] Alternatively, the level of MICAL protein or nucleic acid can be determined in 
vivo using known imaging techniques. For example, an anti-MICAL antibody can be 
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labeled with a radioactive marker whose presence and location in a subject can be detected 
by standard imaging techniques. 

[0336] The results present herein reveal a series of embodiments directed at methods of 
treating neurological disorders involving a failure of axon regrowth and methods for 
inducing regrowth of injured processes of neurons that include altering the oxidative state of 
an affected cell. The present disclosure identifies a novel gene family, the MICALs (also 
referred to as the zeyphyrins, and the 151 family), whose protein domains suggest a novel 
means by which nerve growth is regulated. MICALs are characterized by a flavoprotein 
monooxygenases region (i.e., oxidoreducatase family) that has not been previously shown 
to function in axon guidance. The presence and necessity of this flavoprotein 
monooxygenase domain indicates that this gene family regulates repulsive axon guidance 
through a novel means-oxidation/reduction (redox) mechanisms. The present disclosure 
indicates that MICALs use redox mechanisms to regulate axon growth by directly or 
indirectly destabilizing the cellular machinery (i.e., the actin cytoskeleton) necessary for 
axon outgrowth. 

[0337] The discovery of MICALs and their mechanism of action (i.e., redox 
mechanisms) illuminates a novel general means through which widespread inhibition of 
axon growth can occur-actin oxidation. In particular, the inability of axons to regrow after 
spinal cord injury may be a consequence of the presence of high amounts of reactive oxygen 
species and other oxidative mechanisms in the spinal cord milieu after injury. These 
oxidants may directly alter the structure of the actin cytoskeleton-in effect, acting non- 
specifically like members of the MICAL protein family. In total, these discoveries indicate 
novel and immediate treatments for many neurological disorders, targeting both general 
oxidants as well as the MICALs. In particular, these novel treatments include novel 
therapeutic strategies (e.g., antioxidants and other redox active compounds) and novel 
agents (e.g., EGCG, EC, and other flavonoids and antioxidants) to promote axonal regrowth 
(e.g., following spinal cord injury and similar neurological disorders) as well as novel 
strategies (e.g., oxidants) to limit abnormal and excessive axonal growth (e.g., following 
certain neuropathies, and increased sensitizations). In summary, the discoveries disclosed 
herein, implicate oxidation mechanisms in limiting axon growth and preventing axon 
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regeneration in general; these mechanisms have not previously been suggested or shown to 
be involved in limiting axon outgrowth after spinal cord injury or after other neurological 
disorders. 

[0338] Accordingly, in another aspect, the present invention provides a method for 
treating a neurological disorder involving a failure of axon regrowth, comprising contacting 
a neuron having axons that fail to regrow, or surrounding tissue, with an agent that 
neutralizes oxidants, thereby treating the neurological disorder. The surrounding tissue can 
include any tissue whose components, typically cells, can produce factors that affect axonal 
growth. 

[0339] The agent that neutralizes oxidants can include virtually any anti-oxidant. 
Especially preferred are anti-oxidants that can be delivered orally and that can enter the 
central nervous system or peripheral nervous system. The agent for example, can be 
applied directly to a neuron having axons that fail to regrow. For example, the agent can be 
directly injected to the site of spinal cord injury. A method according to this aspect of the 
invention can be performed in vitro or in vivo. 

[0340] For example, anti-oxidarit vitamins can be used. These vitamins include vitamins 
E, C and beta carotene. Other useful antioxidants for the present invention include, for 
example, methylprednisolone, Tirilazad, lazaroids (2 1 -aminosteroids) and similar steroids, 
alpha tocophenol, lycopene, gamma tocophenol, mannitol, catalase, and glutathione, 
superoxide dismutase. Also useful for the present invention is the anti-oxidant compound H 
290/5 (See e.g., Thomwall M., et al., Acta Neurochir Suppl (Wien) 70:212-5 (1997)), and 
the anti-oxidant AM-36 (Callaway, J. K., X Alzheimers Dis. 2(2): 69-78 (2000)). 

[0341] As indicated herein, monooxygenase inhibitors and anti-oxidant flavonoids, 
including those that are monooxygenase inhibitors such as ECGC and EC, can be used as 
anti-oxidants for this embodiment and embodiments aimed at inducing regrowth of an 
injured process of a neuron. The anti-oxidant flavonoid can be a gallic acid derivative such 
as ECGC or EC. Examples of other gallic acid derivatives that can be used to affect axonal 
guidance regulatory activity in embodiments of the present invention include, but are not 
limited to, (-)-gallocatechin-3-(9-gallate (GCG), (-)-epicatechin-3-0-gallate (ECG), (-)- 
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epigallocatechin (EGC), (+)-gallocatechin (GC), theasinensin A, 3"-0-methyl-EGCG, 3"- 
O-methyl-ECG, 3"-0-methyl-GCG, (-)-epigallocatechin (EGC), (l)-gallocatechin (GC), 
gallic acid, catechin, n-octyl gallate, and n-cetyl gallate. 

[0342] In another embodiment, the present invention provides a method for inducing 
regrowth and/or preventing inhibition of an injured process of a neuron, that includes 
altering the levels of reactive oxygen species in the milieu of the neuron. The method can 
include identifying a site that includes the neuron suspected of having an injured process, 
before altering the levels of reactive oxygen species or other oxidation products in the 
milieu of the neuron. The neuronal process can be an axon or a dendrite. The levels of 
reactive oxygen species or other oxidation products are typically decreased by the method. 

[0343] In certain aspects, levels of reactive oxygen species are altered chronically, as 
discussed herein for application of an agent. For example, the levels can be altered for a 
period of time that is sufficient to permit axon regrowth (i.e. neurorestoration) and the 
establishment of synaptic connections with new targets. This chronic alteration of the level 
of reactive oxygen species, in certain aspects is for at least 1, 2, 7, or 14 days, or 1, 2, 3, 4, 
5, 6, 12, 24, 36, 48, or 60 months after identification or suspicion of the injured process of 
the neuron. 

[0344] In one aspect, oxygen species can be decreased chronically by delivery of a 
recombinant cell that expresses a recombinant enzyme that lowers reactive oxygen species 
to a site of neurological damage or other site in need of regrowth of neural processes. Such 
recombinant enzymes include, for example, catalase and superoxide dismutase. 
Alternatively, enzymes that lower reactive oxygen species can be delivered to a site in need 
of regrowth of neural processes. 

[0345] The method can further include adding an agent that promote neuron process 
regrowth, such as a neurotrophic factor or a neural stem cell, as discussed above, to the 
milieu of the neuron. The milieu of the neuron includes fluids, molecules, and tissues that 
surround a neuron. As discussed hereinabove, the agent that promotes neuron process 
regrowth can be, for example, a neurotrophin, a mechanical bridge, or a stem cell. 
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[0346] In another embodiment, the present invention provides a method for limiting 
abnormal axon outgrowth, that includes contacting a neuron or the milieu of the neuron with 
an agent that affects oxidation state. The abnormal axon outgrowth can be excessive axon 
outgrowth. 

[0347] In another embodiment, the present invention provides a method for improving 
sperm function, that includes contacting a sperm cell, or progenitor thereof, with an 
antioxidant agent in an amount sufficient to modulate MICAL activity. In certain aspects, 
the method includes reducing levels of reactive oxygen species in the milieu of the sperm or 
progenitor thereof. Human spermatozoa exhibit a capacity to generate ROS and initiate 
peroxidation of the unsaturated fatty acids in the sperm plasma membrane, which plays a 
key role in the etiology of male infertility (Sharma R. K., and Agarwal A., Urology, 48:835 
(1996)). Accordingly, MICAL expression can be involved in sperm malfunction through 
oxidation of components of sperm cells. Therefore, agents such as antioxidants, for 
example, antioxidant flavonoids, and particularly monooxygenase inhibitors, such as those 
disclosed herein can be used to improve sperm function and to treat male infertility. 

[0348] In another embodiment, the present invention provides a method for modulating 
cardiac development in a subject, for example a human subject such as a patient in need of 
the method. The method includes contacting a cardiac neural crest cell with an amount of 
an agent that modulates MICAL activity, the amount being effective to modulate cardiac 
development. 

[0349] In another embodiment, the present invention provides a method for treating, 
managing, and/or ameliorating the symptoms of a cardiovascular disease in a human 
subject. The method includes contacting a cardiac cell in the human subject with an amount 
of an agent that modulates MICAL activity. The amount is effective to treat, manage, 
and/or ameliorate the symptoms of the cardiovascular disease. The agent can modulate any 
of the activities of a MICAL included, for example, axon guidance regulatory activity, 
plexin interacting activity, actin binding, and/or monooxygenase activity. 



[0350] In another embodiment, the present invention provides a method for modulating 
an immune response in a human subject in need thereof. The method includes contacting an 
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immune cell in the human subject with an amount of an agent that modulates MICAL 
activity. The amount is effective for modulating said immune response. In certain aspects, 
the immune response is inflammation. MICALs through their involvement in semaphorin- 
mediated pathways, are predicted to be involved in semaphorin-mediated processes of the 
immune system. For example, using a differential display technique, upregulation of . 
semaphorin E in rheumatic synovial fibroblasts has been observed. Accordingly, in certain 
aspects, the human subject for the method for modulating an immune response has 
rheumatoid arthritis. In other embodiments, the human subject for the method of 
modulating an immune response has another inflammatory disease, such as, but not limited 
to, asthma, encephilitis, inflammatory bowel disease, chronic obstructive pulmonary disease 
(COPD), allergic disorders, septic shock, pulmonary fibrosis, undifferentiated 
spondyloarthropathy, undifferentiated arthropathy, inflammatory osteolysis, and chronic 
inflammation resulting from chronic viral or bacteria infections. 

[0351] In another embodiment, the present invention provides a method for inhibiting 
cancer cell proliferation or metastasis in a human subject. The method includes contacting a 
cancer cell in the human subject with an amount of an agent that modulates MICAL 
activity. The amount is effective to inhibit MICAL axon guidance regulatory activity, 
monooxygenase activity, or plexin-interacting activity, thereby being effective to inhibit 
proliferation or metastasis of the cancer cell. As discussed herein, MICALs through their 
involvement in semphorin-mediated pathways are predicted to be involved in semaphorin- 
mediated processes in cancer cells, including metastatic cancer cells. 

[0352] In another aspect, the present invention includes kits that are useful for carrying 
out the methods of the present invention. The components contained in the kit depend on a 
number of factors, including the type of method being carried out. 

[0353] Accordingly, the present invention provides a kit for modulating the activity of a 
MICAL polypeptide. The kit includes an agent in an effective amount and formulation to 
be effectively delivered to a subject. The agent can be any of the agents disclosed herein. 
In certain aspects, the agent is a monooxygenase inhibitor, such as a flavonoid, for example 
a gallic acid derivative. The gallic acid derivative in certain aspects is ECGC or EC. The 
kit also includes instructions, either as a pamphlet provided with the kit, or in an on-line site 
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that provides instructions, for performing the method for modulating activity of the MICAL 
polypeptide. 

[0354] In another aspect of a kit embodiment, the present invention provides one or 
more containers that include a MICAL or MICAL-Like polypeptide or polynucleotide of 
the present invention, a vector that includes the MICAL or MICAL-like polypeptide 
operably linked to a heterologous promoter, and/or a recombinant cell that includes the 
vector. The kits can include instructions for performing any of the methods provided 
herein. 

[0355] In another aspect, the kit can provide a container that includes a MICAL 
detection molecule. A MICAL detection molecule is for example, an antibody, an 
oligonucleotide probe, or any of the other known types of molecules that can be used to 
detect expression or activity of MICAL, as disclosed herein. The kit in certain aspects 
includes an oligonucleotide probe, primer, or primer pair, or combination thereof for 
carrying out a detection method of the present invention, as discussed above. For example, 
the probe, primer, or primer pair, can be capable of selectively hybridizing to a MICAL 
polynucleotide. The kit can further include one or more detectable labels. 

[0356] The following examples are intended to illustrate but not limit the invention. 

EXAMPLE 1 

MICAL is a Large, Cytosolic, Multidomain Protein that Interacts 

with Drosophila Plexin A 

[0357] This example illustrates that Drosophila MICAL is a multi-domain protein that 
interacts with Plexin A. 

Yeast Two-Hybrid Screening 

[0358] Yeast protocols were conducted using standard techniques (Golemis et al., 1994). 
Portions of the intracellular domains of PlexA (amino acids 1702-1945; EST LD13083), 
PlexB (amino acids 1785-2051; EST CK00213), and the corresponding intracellular regions 
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of human Plexin A3, and mouse Plexin A4 (gifts of L. Tamagnone, and H. Fujisawa, 
respectively) were inserted into the yeast bait vector, as described in more detail as follows: 

[0359] The terminal "C2" portion of the PlexA cytoplasmic domain (amino acids 1702- 
1945), which is highly conserved among all plexin family members, was used to search for 
interacting proteins encoded by & Drosophila embryonic (0-24 hrs.) cDNA library. The 
PCR-amplified PlexA C2 domain (the bait) was inserted into the yeast expression vector 
pEG202 (bait vector). Following sequencing of both strands, the bait was introduced into 
the yeast strain EGY48 containing the p-galactosidase expressing plasmid pJK103. Western 
analysis of transformed yeast using an antibody to LexA (Invitrogen) confirmed appropriate 
sized expression and an activation assay showed that the bait could not activate 
transcription on its own . A 0-24 hr. Drosophila embryonic cDNA library was cloned into 
the yeast expression vector pJG4-5 (generated by H. Araj). > 2 X 10 6 clones were screened 
and interactions assessed with a visual p -galactosidase assay and a test of growth in the 
absence of leucine. Yeast clones exhibiting varying degrees of interaction were selected 
and standard protocols were used to recover the library vector and sequence these clones on 
both strands. 

[0360] cDNAs containing the C-terminal of human MICAL-1 (DFkzp434B517) and 
mouse MICAL-2 (BB481898) were used to amplify portions homologous to the last 200 
amino acids of Drosophila MICAL and cloned into the library vector. 

Molecular Analysis 

[0361] Proteins, domains, and alignments were identified using Web-based protein 
domain searching and alignment tools (PFAM, BLAST, PRINTS, JALVIEW, and 
ClustalX) and our own molecular analysis. Human MICAL-1 (EST FLJ1 1937), Human 
MICAL-2 (ESTs BF815128, KIAA1364, KIAA0819), and Human MICAL-3 (ESTs 
KIAA0750, and FLJ14966) were identified by BLAST searches on publicly available 
cDNA and genomic sequence and in some cases overlapping ESTs were assembled 
virtually. Drosophila MICAL-L (EST LD45758) and human MICAL-L1 (EST 
XM001070) and MICAL-L2 (ESTS FL00139 and FLJ23471) were identified by searching 
publicly available cDNA and genomic sequence databases. 
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RESULTS 

[0362] To identify mediators of semaphorin-dependent repulsive axonal guidance the 
terminal highly conserved "C2" portion of the PlexA cytoplasmic domain was used to 
search for interacting proteins encoded by a Drosophila embryonic (0-24 hrs.) yeast two- 
hybrid cDNA library (Figures 5A-C). Fifty-two interactors encoded by cDNAs derived 
from four different genes were identified. Over half of these interactors were encoded by a 
single gene and two overlapping cDNAs encoded by this gene were selected for further 
study (clones 23 and 151). 

[0363] Yeast interactions using the C2 domain of PlexA as the bait (Plexin A C2) were 
assessed and the strongest interactors (e.g., clones 23, and 151), as determined by a p- 
galactosidase assay (Beta Gal Activity), were derived from MICAL (Fig. 5B). Further, the 
bait construct (Plexin A C2) was cloned into the library vector and clone 151 was inserted 
into the bait vector and an interaction assay demonstrated the vector independence of these 
interactions . Clones 23 and 15 1 do not interact with the C2 domain of Drosophila PlexB. 
Though GOF experiments suggest that PlexB functions like PlexA to signal semaphorin- 
mediated motor axon repulsion (Hu et al., 2001), we have thus far not observed associations 
between MICAL and PlexB proteins. The C2 domains of human Plexin A3 (HPlexin A3 
C2) and mouse Plexin A4 (MPlexin A4 C2) interact strongly with the plexin interacting 
regions (PIR) of human MICAL- 1 (HMICAL1 PIR) and mouse MICAL-2 (MMICAL2 
PIR), respectively. 

[0364] DNA sequence analysis suggested that the overlapping cDNA clones we 
identified in our yeast screen did not encode a full-length gene product. These cDNAs have 
an open reading frame (ORF) at their 5' ends and a stop codon near their 3' ends, indicating 
we had identified the C terminal 255 amino acids of this novel protein. Northern analysis 
using standard techniques (Sambrook et al, 1989) on 0-24 hr Drosophila embryonic total 
RNA and a portion of clone 151 as a probe showed that the full-length transcript from this 
gene is greater than 10 kilobases (Kb) in length (Clone 151, 3' MICAL) (Fig. 5C). Given a 
lack of publicly available expressed sequence tags (ESTs) extending our cDNA further 5', 
we used one of our initial cDNA clones (clone 23; Figure ID) to screen a Drosophila 
embryonic lambda gtl 1 phage cDNA library (a generous gift from K. Zinn) for full-length 
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cDNAs using standard techniques (Sambrook et al., 1989). The longest clones were 
selected and sequenced on both strands. We were unable to identify a single full-length 
transcript, so we conducted an extended cDNA walk to obtain full-length MICAL 
transcripts. Isolated cDNAs were assembled to identify full-length MICAL isoforms. 
Northern analysis using a probe derived from the 5 5 end of an assembled full-length cDNA 
detected a large transcript of greater than lOkb (5' MICAL). This transcript is similar in 
size to the transcript detected with a probe from the 3' portion of this assembled cDNA, 
providing further evidence that the 3' and 5' portions of the assembled full-length cDNAs 
are from the same transcript or group of large transcripts. 

[0365] The genomic organization of the MICAL locus was determined using the 
Sequencer 2.1 program (Gene Codes Corp.), the identified cDNAs, and publicly available 
Drosophila genomic DNA sequences. This extensive molecular analysis demonstrated that 
the Drosophila gene defined by clones 23 and 151, is Drosophila MICAL, and covers 
>41kb of genomic sequence and has at least 25 exons (Figure 1 A; see Figure 5). Based on 
analysis of isolated cDNAs and western analysis (see Figure 6D), there are at least three 
MICAL isoforms ("long," "medium," and "short" variants; Figure 1 A). 

[0366] At the MICAL C-terminus is the plexin interacting region that was identified in 
the yeast screen. Within the plexin interacting region there is a predicted heptad-repeat, 
coiled-coil structure (Figure IB), a motif thought to be involved in protein-protein 
interactions (Burkhard et al., 2001). Interestingly, this region of MICAL shares amino acid 
similarity with several other coiled-coil domain-containing proteins including a portion of 
the alpha domain found in the Ezrin, Radixin, and Moesin (ERM) proteins (-22% identity; 
Bretscher et al., 2000). The last four amino acids of MICAL (ESII) are a PDZ protein 
binding motif (Harris and Lim, 2001). N-terminal to the plexin interacting region of 
MICAL there is a proline rich region. MICAL has two regions of varying length, variable 
regions (1) and (2), which have no significant similarity to any other proteins and which 
appear to determine the size of the different MICAL proteins (Figure IB). MICAL has a 
single LIM domain (Figure IB), a protein-protein interaction module found in a variety of 
proteins involved in signal transduction cascades and in cytoskeletal organization (Bach, 
2000), and also a single calponin homology (CH) domain (Figure IB), a domain also found 
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in cytoskeletal and signal transduction proteins and known to be involved in actin filament 
binding (Gimona et al., 2002). The MICAL N-terminal -500 amino acid domain is highly 
conserved among MICAL-related proteins (see below), but is unique over its entire length 
in comparison to other proteins. 

EXAMPLE 2 

MICAL IS EXPRESSED ON DROSOPHILA EMBRYONIC MOTOR AND 
CNS AXONS AND COIMMUNOPRECIPITATES WITH PLEX A 

[0367] This example illustrates that MICAL is expressed in axons and that MICAL 
interacts with PlexA. 

In Situ Hybridization 

[0368] RNA in situ analysis of whole-mount Drosophila embryos and cryosections of 
El 5 and El 8 rat spinal cords were as described (Kolodkin et al., 1993; Pasterkamp et al., 
1998). 

Development of HA-PlexA transgenic flies 

[0369] The HA-PlexA construct was created by inserting in the correct orientation an in- 
frame PGR amplified HA sequence into the EcoRI site that links the artificial signal 
sequence and the extracellular domain of PlexA in a PlexA pSectag B construct generously 
provided by C. Goodman. Following sequencing of the insert, the entire HA-PlexA cDNA 
was inserted into the pUAST vector; one transgenic fly was obtained. 

MICAL Antibody Generation, Western Analysis, Immunohistochemistry, and 
Immunoprecipitation 

[0370] Antibodies were generated and characterized as described (Yu et al, 1998). 
cDNAs corresponding to the last 359 amino acid of MICAL (MICAL-CT antibody) were 
inserted into the pTrcHisA vector (Invitrogen). MICAL-CT antibodies were used for 
Western analysis at a 1:2000 dilution, and on Drosophila embryos at 1:3000 dilution. 
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[0371] Embryos generated by crossing UAS-HAPlexA and Elav-GAL4/Cyo adults were 
collected, and co-immunoprecipitations were performed using standard techniques, and an 
HA monoclonal antibody (12CA5; Roche). Western analysis was performed using an HA 
antibody (1 :3000, rat mAb Clone 3F10, Roche), our MICAL-CT antibody (1 :2000), and an 
Enabled antibody (1:500; IG6C10; gift from D. Van Vactor). 

Immunoprecipitations 

[0372] Embryos generated by crossing UAS-HAPlexA and Elav-GAL4/Cyo adults were 
collected, dechlorinated with 50% bleach, and 100-200 mg of embryos were lysed in either 
1) lmL of RIPA buffer (150mM NAC1, 1.0% NP-40, 0.5% deoxycholate, 0.1% SDS, 
50mM Tris, pH 8.0, 0.2mM NaV0 4 , lOmM NaF, and protease inhibitor cocktail (Sigma) 
and 20ug/mL PMSF), 2) lmL of 1% Triton buffer (150mM NaCl; 50mM Tris-HCl, pH 
8.0), or 3) 1% NP-40 (150mM NaCl, 5mM Tris-HCl, pH 8.0) using a tight 2mL Dounce 
homogenizer at 4°C. Similar results were observed with each buffer. Extracts were cleared 
by ultra centrifugation at 100,000xg for 15 min. at 4°C and added to 50^L of a 50% slurry 
of Gammabind G beads (Amersham) for 30min with rocking at 4°C. Lysates with beads 
were then centrifuged for 30 minutes and supernatants were immunoprecipitated for 30 
minutes with 2|*L anti-HA per sample (mouse monoclonal antibody (Clone 12CA5, Roche). 
100(j.L of Gammabind G beads were then added to the sample, and the samples were 
incubated for 90 min. at 4°C with rocking, washed 6 times with lysis buffer and resuspended 
in 50|aL of Laemmli loading buffer. Western analysis was performed using an HA antibody 
(1 :3000, rat mAb Clone 3F10, Roche), our MICAL-CT antibody (1 :2000), and an Enabled 
antibody (1:500; IG6C10; a generous gift from D. Van Vactor). 

RESULTS 

[0373] In situ hybridization analysis using RNA probes corresponding to the N- or C- 
terminal of MICAL shows that MICAL and PlexA have similar patterns of embryonic 
mRNA expression. During early Drosophila development (stages 7-8), both MICAL and 
PlexA are expressed in the ventral neurogenic region and in many non-neuronal tissues 
(including developing mesoderm, cells surrounding the cephalic furrow and 
amnioproctodeal invagination, and in gut primordia). This non-neuronal expression is also 
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seen later in embryonic development (stages 1 1-17), where both MICAL and PlexA are 
present within the anterior and posterior midgut primordia, the visceral musculature, and 
weakly in somatic musculature. During axonal pathfinding (stage 13 onward) both MICAL 
and PlexA are expressed within the developing brain and ventral nerve cord in most, if not 
all, CNS neurons but MICAL, like Semala and PlexA, is not highly expressed in peripheral 
sensory neurons. 

[0374] Western blot analysis using a polyclonal antibody directed against the MICAL C- 
terminus (MICAL-CT) revealed prominent bands at 530kD, 330kD, 300kD, 200kDa, and 
125kDa in lysates from wild-type embryos which are seen at greater intensity in lysates 
from embryos harboring a chromosomal duplication that includes the MICAL locus 
(see Figure 6). The three largest protein bands are in agreement with the molecular weights 
of the three MICAL isoforms predicted from our analysis of MICAL cDNAs (Figures 1A). 
MICAL immunoreactivity was not observed in embryonic lysates obtained from mutant 
embryos harboring a deficiency which includes the MICAL locus (see below), showing 
these products identified by Western analysis are derived from MICAL and that our 
antibodies are MICAL-specific (see Figure 6D). 

[0375] MICAL protein is present in neuronal cell bodies, along axons, and in growth 
cones. MICAL immunostaining first appears in the nervous system at stage 13 and labels 
motor and CNS projections. At later embryonic stages, MICAL immunostaining is present 
on axons that make up all motor axon pathways: the intersegmental nerve (ISN); the 
intersegmental nerves (ISNb and ISNd); and the segmental nerves a and c (SNa, and SNc) . 
MICAL immunostaining is also present in segment boundaries at the position of muscle 
attachment sites and at low levels in the lateral cluster of chordotonal organs. 

[0376] To ask whether PlexA and MICAL directly interact in neurons, transgenic flies 
were generated that contain a transgene encoding epitope-tagged PlexA (HA-PlexA) under 
the control of an upstream activator sequence (UAS) (Brand and Perrimon, 1993) and 
crossed with flies that express the GAL4 transcription factor in all neurons (Elav-GAL4). 
Lysates from embryos containing both HA-PlexA and Elav-GAL4 elements were subjected 
to immunoprecipitation using HA antibodies and then Western blotting with MICAL-CT 
antibodies. Robust co-immunoprecipitation of MICAL was observed using HA antibodies 
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and also reciprocal co-immunoprecipitation of HA-PlexA using MICAL-CT antibodies . 
The "large" MICAL isoform is the predominant variant observed to be associated with 
neuronally-expressed HA-PlexA, which may reflect tissue-specific expression of this 
isoform in neurons. MICAL co-immunoprecipitation appears specific since enabled (Ena), 
a neuronally expressed cytosolic protein, was not co-immunoprecipitated by HA antibodies 
in similar experiments and Unc5, a neuronally-expressed transmembrane receptor was not 
co-immunoprecipitated by MICAL-CT antibodies . 

EXAMPLE 3 

A MICAL LOSS-OF-FUNCTION MUTANT DEMONSTRATES THAT MICAL IS 
REQUIRED FOR MOTOR AXON PATHFINDING 

[0377] This example illustrates that MICAL is required for motor axon pathfinding. 

Drosophila Genetics and Phenotypic Characterization 

[0378] Drosophila genetics, transformations, and preparation and analyses of 
Drosophila embryos was performed as described (Winberg et al., 1998b; Yu et al., 1998). 
The cytological location of MICAL was determined by hybridizing a radiolabeled cDNA 
probe corresponding to either the 5' or the 3' regions of the MICAL ORF on a Drosophila 
genomic PI clone fdter (Genome Systems) and following the manufacturer's instructions. 
MICAL is located on the third chromosome of Drosophila in the 85F3-6 chromosomal 
location . Unfortunately, this region was devoid of any small, publicly available, 
deficiencies and candidate MICAL mutations. 

[0379] To generate a MICAL LOF mutant we identified two P transposable elements 
closely flanking the MICAL locus and used a P element transposase-mediated mutagenesis 
strategy to delete the region between these P elements (Cook et al., 2001; Cooley et al., 
1990; Preston et al., 1996). A search of public databases revealed two transposable 
elements that flanked the MICAL locus and were separated by -165 kbs. Our molecular 
analysis confirmed that one Q(3)s2681; Bloomington Stock Center) was located in a 
separate gene <3kb from the putative 5' end of MICAL and the other (EP(3)3681; Berkeley 
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Drosophila Genome Project) was -120 kb from the MICAL 3' end . A third P element 
(1(3)1 0477) was situated between these two elements and served as a genetic marker. Our 
molecular analysis shows that 1(3)10477 (Bloomington Stock Center) is situated in a novel 
gene ~70kb from the 3' end of MICAL, between l(3)s2681 and EP(3)3681. 1(3)10477 adult 
flies hold their wings out-stretched at a 45° angle from their bodies. Molecular analysis, 
genetic complementation analysis, and identification of additional 1(3)10477 alleles 
(including on the TM3 balancer) show that the wing phenotype associated with 1(3)10477 is 
recessive and due to a mutation of a gene located ~70kb from the 3' end of MICAL (Terman 
and Kolodkin, unpublished). We have called this new gene stretched out (stretch) and used 
it as a genetic marker to identify MICAL deletions (submitted to Flybase). (1) Using 
standard genetic techniques, we generated adult flies containing each starting P element in 
trans (EP(3)3681/l(3)s2681). (2) A source of P element transposase was introduced as a 
mutagen and screened for the appearance of adult progeny containing a stretch wing 
phenotype (Screened through -25,000 flies). 

[0380] 80 adults were identified with a stretch-like wing phenotype and individual fly 
stocks were made for each. Each stock was then re-scored for the stretch phenotype. Five 
candidate MICAL deficiency lines were identified and the extent of the deletion was mapped 
using standard techniques by asking whether these lines complemented ("+") or failed to 
complement ("-") (i.e., showed lethality or the wing phenotype) when crossed to flies 
containing chromosomal aberrations flanking the MICAL locus. The fly stocks used in the 
mapping (complementation analysis) experiments were as follows: Df(3R)byl0 
(85D8;85E10-13; Bloomington Stock Center), Df(3R)by62 (85D11-14;85F6, 041h; 
85F6(T); Bloomington Stock Center), Df(3R)segG16 (segregant Dp(2;2)G16[ZD]TE35B- 
3[2P]; breakpoints=Df(3R)85F6-8;85F12-86A2 + Dp(2;2)35B2\35D7 was made from 
T(2;3)G1 6; for simplicity referred to as Df(3R)segG16; kindly provided by John Roote and 
Michael Ashburner), and two small deficiencies generated in our lab (Df(3R)mr73 and 
Df(3R)mrO! ; Terman and Kolodkin, unpublished). One of the five lines which exhibited a 
stretched out wing phenotype and removes MICAL (Df(3R)swp2 MICAL ) was used to 
characterize the MICAL loss-of-function phenotype, and the extent of this deletion is 
indicated in Fig 6C. 
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Western blot analysis 

[0381] Late stage 16/17 Drosophila embryos were genotyped and homogenized in lysis 
buffer (1% NP-40, 50 mM Tris-HCl pH 8.0, 2 mM EDTA, protease inhibitor cocktail), run 
on 4% SDS-PAGE gel, and subjected to western blotting. Lysates from 5 wild-type 
embryos, 5 embryos carrying a duplication of the MICAL locus (Dp(3;3)M86D[+]2 (85D1- 
4;87A5; Bloomington Stock Center), and 5 Df(3R)swp2 A//C4i embryos were blotted with 
either MICAL polyclonal antisera (MICAL-CT) or as a loading control an enabled 
monoclonal antibody (IG6C10). Prominent bands are observed at 530 kD, 330 kD, 300 kD, 
200 kDa, and 125 kDa in wild type and at stronger intensity in MICAL duplication embryo; 
none of these bands are observed in Df(3R)swp2 MICAL embryos. 

Drosophila Transformation Constructs 

[0382] A MICAL rescue construct (UASMICAL) was physically assembled from isolated 
cDNAs and cloned into the pUAST vector for Drosophila germline transformation (Yu et 
al, 1998). Three independent transgenics were obtained. In addition to using the rescue 
construct to attempt to rescue the LOF and the genetic interaction phenotypes, this construct 
was used to examine the effects of overexpression of MICAL in all neurons. Expressing 
one copy of UASMICAL in all neurons in a wild-type background resulted in less penetrant 
phenotypes than expressing 2 copies . 

RESULTS 

[0383] To determine whether MICAL functions in vivo to propagate Sema- 1 a-mediated 
motor axon guidance, detailed genetic analyses were performed of MICAL gain- and loss- 
of-function mutants. A small, tractable, deficiency (called Df(3R)swp2MICAL; see 
Figure 6) was generated that removes ~170Kb that includes the MICAL locus and -six 
other genes. Western blot analysis on lysates from embryos homozygous for 
Df(3R)swp2MICAL demonstrates a loss of all MICAL protein (Figure 6D), and no MICAL 
immunostaining is observed in these embryos . These data, in combination with rescue 
experiments using a MICAL cDNA (see below), define the small deficiency 
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Df(3R)swp2MICAL as a MICAL null allele. Df(3R)swp2MICAL homozygotes survive 
into larval stages and have no overt morphological abnormalities (see Figure 6). 

[0384] If MICAL functions in Sema-la/PlexA-dependent repulsive axon guidance, then 
MICAL LOF mutants should exhibit motor axon guidance defects similar to the distinct and 
highly penetrant defects seen in Semala and PlexA LOF mutants. The development of the 
stereotypic pattern of neuromuscular connectivity in embryonic Drosophila abdominal 
segments is observed with anti-fasciclin II (mAb 1D4) staining of stage 16/17 embryos 
(VanVactor et al, 1993). Motor axons initially exit the CNS as part of either the 
intersegmental nerve (ISN) or the segmental nerve (SN). They are then guided into five 
major nerve branches (the ISN, ISNb, ISNd, SNa, and SNc), each of which targets different 
muscle groups such that individual motor axons eventually innervate individual target 
muscles (Landgraf et al., 1997). 

[0385] In wild-type embryos, the ISNb is formed by ISN axons defasciculating and 
extending dorsally through the ventral musculature to innervate muscles 6 and 7 and 
muscles 12 and 13. Axons within the ISNb pathway in Semala or PlexA mutants often fail 
to defasciculate and innervate their muscle targets (Table 1; Winberg et al., 1998b; Yu et al., 
1998). In the absence of MICAL, axons within the ISNb show similar highly penetrant 
ISNb phenotypes (Table 1). These phenotypes include the failure of some or all axons to 
defasciculate from the ISN, stalling of axons within the ISNb following defasciculation 
from the ISN, ISNb axons bypassing their target muscle groups, and greatly reduced or 
absent innervation of target muscles. 

[0386] Axons within the SNa pathway in MICAL mutants also exhibit highly penetrant 
defects similar to those observed in both Semala mutants and PlexA mutants. In wild-type 
embryos, SNa axons defasciculate from the SN and extend through the ventral musculature 
as a single tightly fasciculated bundle. At the dorsal edge of muscle 12, SNa axons 
defasciculate to give rise to a dorsal (D) and lateral (L) branch. Axons within the dorsal 
branch extend dorsally between muscles 22 and 23 and then make two characteristic turns, 
continuing further dorsally between muscles 23 and 24 . hi Semala and PlexA mutants, 
SNa axons within the dorsal branch often stall near muscle 12 and fail to reach the dorsal- 
most portion of their trajectory (Table 1; Winberg et al., 1998b; Yu et al., 1998). MICAL 
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mutants exhibit similar, highly penetrant, SNa stall phenotypes (Table 1). MICAL mutants 
also exhibit prominent guidance defects in axons that give rise to the ISNd, SNc, TN, and 
the third most lateral fasciclin II -positive CNS longitudinal connective-defects, which have 
been observed in Semala and PlexA mutants (Winberg et al, 1998b; Yu et al, 1998). In 
MICAL LOF mutant embryos additional phenotypes beyond those seen in PlexA and 
Semala mutants were not observed, suggesting that MICAL primarily functions during 
Drosophila neural development in PlexA signaling events. 

[0387] MICAL expression was restored in homozygous Df(3R)swp2 MICAL embryos 
using one copy of the transgenic construct UAS- MICAL under the control of the neuron- 
specific driver Elav-GAL4. Due to the large size of the MICAL protein, rescue was 
attempted using the smallest MICAL isoform— the 300kD "small" form (Figure 1 A). The 
level of neuronal MICAL expression observed by immunostaining with MICAL antibodies 
in three independent MICAL transformants was somewhat lower than that seen in wild type 
embryos . Neuronal MICAL expression did not rescue the adult lethality in Df(3R)swp2 

MICAL i 

homozygotes, suggesting a requirement for the MICAL "long" form, other genes 
within the Df(3R)swp2 MICAL deficiency, and/or MICAL in non-neuronal cells for adult 
viability. We did, however, observe that neuronal MICAL expression in homozygous 
Df(3R)swp2 MICAL embryos almost completely rescues embryonic ISNb and SNa motor 
axon guidance defects (Table 1) and CNS longitudinal connective defects . Therefore, axon 
guidance phenotypes observed in the MICAL deficiency Df(3R)swp2 MICAL result from a 
lack of neuronal MICAL. 
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Table 1. Axon guidance phenotypes (ISNb and SNa Phenotypes) 



Genotvpe (Tiernisegments) 


Abnormal 
ISNb 


Abnormal 
Muscle 6/7 
Innervation ( y ) 


Abnormal 
Muscle 12/13 
Innervation 


SNa 
PathwavW 












+/+(wild type) (n-120)) 


0% 


1.7% 


2.5% 




Elav-GAL4I± (n=130) 


0% (0%) 


1.5% (0%) 


4.6% 


8.5% (0%) 


Elav-GAL4/Elav-GAL4 (n=109) 


0% (0%) 


2.8% (0%) 


8.2% 


12.8% (0%) 


Df(3R)swp2 MICAL /+ (n=l 10) 


0% 


J.O /O 


/.2% 


7.3% 


Semala PI I+ fn=1 IfVi 


0% 


2.7% 


8.1% 


9.1% 


Df(4)C3 PlaA l+(n=\0<S) 


0% 


1.0% 


3.0% 


12.0% 


LOSS OF FUNCTION: 










Df(3R) SW p2 M!CAL /Df(3R)swp2 mCAL (n=103) 


1.0% 


68.9% 


57,3% 


81.2% 


SemaIa P! /Semala p ' (n=>97) 


5.2% 


47.4% 


77.3% 


85.7% 


Df(4)C3 p,acA /Df(4)C3 p,exA (n=148) 


4.1% 


60.8% 


47.3% 


74.3% 


Elav-GAL4IUASMICAL; 










Df(3R)swp2 M,CAL /Df(3R)swp2 mcAL (n=l38) 


0% 


5.8% 


13.0% 


20.3% 


UASMICAL G * w /+;Elav-GAL4l+; 










uj{DKjswpz IJJ]{5R)swp2 (11=137) 


0% 


67.2% 


68.6% 


69.7% 


OtilNJti I1C 1JN lJcJKACllONS: 










Semala P1 /+:Df{3R)swn2 MICAL /+ fn=1W) 


.5.0 /o 


32.7% 


37.3% 


51.8% 


Df(3R)swp2" ll ^/+;Df(4)C3™ exA /+ (n=105) 


0% 


41.9% 


33.0% 


44.6% 


Semala pl /+\Df(4)C3 PIexA /+ (n=108) 


0% 


32.4% 


39.8% 


68.5% 


Cjinty* 171-.-, /~< A T A /J - 

oemaia ,iilav-(jrAL4/-r; 










UASMICAL/Df(3R)swp2 MICAL fh-99) 


0% 


6.1% 


6.1% 


9.2% 


GAIN OF FUNCTION: 










Elav~GAL4/Elav-GAL4; 










UASMICAL Myr - cr /UASMICAL Myr - c \r^=97) 


2.1% (50%) 


32.0% (3.2%) 


48.5% 


46.4%(2.2%) 


UASMICAL/UASMICAL; 










Elav-GAL4/Elav-GAL4 (n=130) 


0.8% (100%) 


44.2% (59.6%) 


50.0% 


38.0% (71.4%) 


UASMICAL G * W /+; 










Elav-GAL4f+ (n=170) 


2.4% (100%) 


36.7% (43.5%) 


44.4% 


48.5% (37.8%) 



„__, r „.. ----- — — wlMWUlfll * " iC a^«ns siainng, oypassmg targets, absent or 

decreased muscle innervation; "failure to make the two characteristic turns between muscles 22 and 23 and muscles 23 and 24- x all ISNb 
axons follow the ISNd or SNc, or ISNb axons remain fasciculated with the ISN but ultimately wander in the lateral muscle fieldsor 
project back to innervate ventral muscles; "increased (long or excessively thick) muscle innervation, excessive branching, projecting into 
abnormal target fields; ^asciculation with the ISN, premature branching, following abnormal pathways, termination on wrong muscles 
(x,y,z indicate percent (%) of defects in a,b,and c, respectively). * 
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EXAMPLE 4 

MICAL GENETICALLY INTERACTS WITH SEMA1 A AND PLEXA 

[0388] This example illustrates that MICAL and PlexA function in the same signaling 
pathway to guide motor axons. 

RESULTS 

[0389] To address whether MICAL functions in the same signaling pathway with PlexA 
to mediate Sema- la-mediated repulsive axon guidance classical genetic interaction analysis 
was employed by asking whether heterozygosity at both MICAL and PlexA, or MICAL and 
Semala, resulted in phenotypes not observed in either heterozygote alone. MICAL, PlexA, 
or Semala heterozygous embryos show no motor axon guidance defects (Table 1). 
Embryos heterozygous for both Semala and PlexA (Semala/+;Df(4)C3P lexA /+) show highly 
penetrant axon guidance defects similar to those observed in homozygous Semala or PlexA 
mutants (Winberg et al., 1998b; Table 1). Embryos heterozygous for both MICAL and 
Semala, or heterozygous for both MICAL and PlexA, exhibit axon guidance phenotypes 
similar to those seen in Semala/+;Df(4)C3 plexA /+ embryos, and these are seen at 
approximately equal penetrance (Table 1). For example, the ISNb and SNa of Semala/+; 
Df(3R)swp2 MICAL /+ or Df(3R)swp2 MICAL /+;Df(4)C3 plexA /+ embryos exhibit guidance errors 
at specific choice points similar to those seen in homozygous PlexA, Semala, or MICAL 
mutant embryos. One copy each of both the UAS-MICAL and Elav-GAL4 transgenes was 
introduced into the Semala/+; Df(3R)swp2 MICAL /+ background and observed that neuronal 
MICAL expression rescues both the ISNb and SNa phenotypes in these transheterozygous 
embryos (Table 1). These results support the idea that MICAL and PlexA function in the 
same signaling pathway to guide motor axons. 
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EXAMPLE 5 

MICAL GAIN-OF-FUNCTION AXON GUIDANCE PHENOTYPES 

[0390] This example provides results that further establish that MICAL participates in 
PlexA-mediated motor neuron guidance, and illustrates that dominant negative MICAL 
mutants can be generated. 

[0391] The MICAL Myr_CT construct was constructed by PGR amplification of the plexin 
interacting region of MICAL with PCR primers containing a myristoylation sequence (base 
pairs corresponding to the first 14 amino acids of Drosophila src; Simon et al, 1985), cloned 
into the pUAST vector, sequenced on both strands, and one Drosophila transgenic was 
obtained as described above. Embryos expressing 1 copy of UASMICAL Myr CT in all 
neurons using the GAL4-UAS system {Elav-GAL4) exhibited phenotypes less penetrant 
then when 2 copies were expressed . 

RESULTS 

[0392] To complement MICAL LOF analysis, it was determined whether MICAL GOF 
mutants exhibit motor axon guidance phenotypes similar to those observed in PlexA GOF 
mutants (Winberg et al. ? 1998b). MICAL was overexpressed in all neurons in a wild type 
background using the GAL4-UAS system and our rescue construct. Neuronal 
overexpression using one or two copies of our MICAL rescue construct leads to highly 
penetrant motor axon guidance phenotypes (Table 1). GOF phenotypes resulting from one 
copy of the MICAL rescue construct in a wild type background can be suppressed in a 
Df(3R)swp2 MICAL genetic background (Table 1). These defects in some cases are quite 
similar to the defects observed in MICAL mutants and defects reported in PlexA GOF 
mutants (Winberg et al., 1998b). However, a large fraction of these MICAL GOF motor 
axon guidance phenotypes are consistent with increased defasciculation (Table 1), as 
similarly described for the PlexA GOF mutants. For example, ISNb axons were often seen 
to abnormally leave the ISNb and project incorrectly within the ventral musculature 
(Table 1). Likewise, some SNa axons defasciculated at incorrect locations and projected to 
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inappropriate areas (Table 1). Therefore, MICAL GOF mutants exhibit phenotypes similar 
to PlexA GOF mutants, again suggesting that MICAL participates in PlexA-mediated motor 
axon guidance. 

[0393] Additional support for MICAL's role in PlexA signaling was obtained by 
expressing in all neurons a truncated MICAL protein consisting only of the MICAL PlexA- 
interacting region (Figure IB). This protein was targeted to the cell membrane by 
introducing an N-terminal myristoylation sequence (MICALMyr-CT) and found that 
neuronally expressed MICALMyr-CT acts in a dominant-negative fashion, resulting in axon 
guidance phenotypes similar to those observed in MICAL mutants (Table 1). Prominent 
GOF phenotypes like those resulting from MICAL or PlexA overexpression were not 
observed, indicating that neuronal MICALMyr-CT is likely occluding normal MICAL- 
PlexA associations and therefore MICAL signaling. This also suggests that the MICAL 
protein contains domains distinct from the PlexA-interacting domain that function to 
regulate axonal guidance. 



EXAMPLE 6 

THE MICALS ARE A FAMILY OF NEURONALLY EXPRESSED, PLEXIN- 
INTERACTING PROTEINS CONSERVED FROM FLIES TO MAMMALS 

[0394] This example illustrates that MICAL proteins have conserved protein domains 
with identical organization in all family members. 

RESULTS 

[0395] MICAL proteins have conserved protein domains with identical organization ii 
all family members and a high degree of amino acid identity among these domains in 
different MICALs (Figure 2A). Suzuki et al. (2002) identified MICAL- 1 and a partial 
sequence of MICAL-2. One MICAL was identified in Drosophila and three mammalian 
MICALs were identified. The MICALs appear unique with respect to containing both 
calponin homology (CH) and LIM domains, in addition to their conserved N- and C- 
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terminal regions. A family of MICAL-like (MICAL-L) proteins were also identified, 
members of which have a similar organization to MICALs but lack the region N-terminal to 
the CH domain (Figure 2B). There is one MICAL-L protein in Drosophila (D-MICAL-L) 
and at least two family members in humans. D-MICAL-L cDNA and genomic DNA 
sequence information suggest that D-MICAL-L begins just N-terminal to the CH domain. 
Analysis of publicly available mammalian cDNA and genomic sequences suggests that 
human MICAL-L 1 and MICAL-L2 are similar in overall domain organization to D- 
MICAL-L and do not contain the highly conserved -500 amino acid MICAL N-terminal 
domain. 

[0396] To address whether the function of MICALs is conserved in vertebrates, 
expression patterns and interactions with plexins were analyzed. It was found that the 
mRNA of all three rat MICALs shows specific neuronal and non-neuronal expression 
during development. For example, MICAL 1, MICAL2, and MICAL3 are expressed in the 
rat spinal cord, dorsal root ganglia (DRG), and sympathetic ganglia at embryonic days 15 
(E15) and El 8 in patterns which appear overlapping but distinct. Interestingly, the neuronal 
expression patterns of individual MICALs are similar to those observed for several plexins, 
as can be seen for PlexA3 and MICAL 1 . In addition, results presented herein indicate that 
the plexin interacting domains of human MICAL- 1 and mouse MICAL-2 specifically 
interact with the C2 domains of human PlexA3 and mouse PlexA4, respectively, and do so 
as strongly as the autologous domains of Drosophila MICAL and PlexA (see Figure 5B). 



EXAMPLE 7 

MICALS CONTAIN AN N-TERMINAL FLAVOPROTEIN 
MONOOXYGENASE DOMAIN 



[0397] This example illustrates that MICAL proteins include a highly conserved an N- 
terminal monooxygenases domain. 
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MICAL Flavoprotein Monooxygenase Fusion Protein 
Purification and FAD Binding 

[0398] A His-tagged bacterial fusion protein was constructed that included the MICAL 
flavoprotein monooxygenase domain (MICAL FM) by inserting amino acids 1-526 of 
Drosophila MICAL into the bacterial expression vector pET 43.1b containing a 
hexahistidine tag (Novagen). The plasmid was transformed into E. Coli BL21 (DE3) and the 
hexahistidine tagged recombinant protein was expressed by IPTG induction and MICAL 
FM was isolated with the inclusion bodies, denatured with 6M GdmHCl (Gibco) and 
purified under denaturing conditions over a Ni 2+ column (Novagen). MICAL FM was 
renatured at 25°C for 3 hours by diluting the purified protein 100X into a solution 
containing a 5-fold molar excess (to MICAL FM present in the solution) of free FAD 
(Sigma), lOmM DTT, and 10]ng/mL BSA as described (Lindsay et al., 2000). Renatured 
protein was dialyzed for 48 hours at 4°C into 5 changes of His binding buffer (5mM 
imidazole, 0.5 M NaCl, 20mM Tris-HCl, pH 7.9) to remove free FAD, and DTT. Ni 2 + 
purification beads were then incubated with the MICAL FM sample in batch for 5 hours, at 
4°C. The solution was then repurified through a Ni 2+ column. Fractions containing MICAL 
FM were pooled and subjected to dialysis into a more stable buffer containing 5mM DTT, 
and then subjected to Coomassie staining and Western analysis to confirm the purity of the 
sample. Spectral analysis was done using a Perkin-Elmer UV/VTS Lambda- 12 
spectrophotometer scanning from 300 to 550nm. 

RESULTS 

[0399] The high degree of conservation of the MICAL N-terminus among family 
members (up to 62% identical between flies and humans; Figure 2A) suggests that this 
domain is functionally important. Upon closer examination of this conserved region, we 
noted a consensus dinucleotide binding sequence, GXGXXG (Figures IB and 3 A), which is 
distinct from the sequence present in classical mononucleotide binding motifs (Eggink et 
al., 1990; Eppink et al., 1997; Schulz, 1992; Wierenga et al., 1986). Further, the amino acid 
sequence in this 500 amino acid region reveals that MICALs contain three separate 
sequence motifs spaced throughout this domain that define them as flavoprotein 
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monooxygenases (also called hydroxylases), a subclass of oxidoreductases (Eggink et aL, 
1990; Eppink et aL, 1997; Wierenga et aL, 1986). The amino acid sequence surrounding the 
GXGXXG motif matches perfectly the consensus sequence for the ADP binding region of 
flavin adenine dinucleotide (FAD) binding proteins (Rossmann fold or FAD Fingerprint 1, 
Figures IB and 3 A), and distinguishes this region from consensus NAD, or NADP binding 
folds (Vallon, 2000; Wierenga et aL, 1986). MICALs also have a well-conserved GD motif 
(FAD Fingerprint 2; Figures IB and 3 A) C -terminal to the FAD Fingerprint 1 region, which 
is important for binding the ribose moiety of FAD (Eggink et al., 1990; Eppink et aL, 1997). 
Finally, MICALs have the conserved DG motif ("Conserved Motif; Figures IB and 3A) 
between the FAD Fingerprint 1 and 2 motifs that has been reported to be involved in 
binding the pyrophospate moiety of FAD (Eppink et aL, 1997). Proteins with these 
consensus FAD binding regions bind FAD and use FAD in the catalysis of oxidation- 
reduction reactions. Flavoprotein monooxygenases are oxidoreductases (enzymes that 
catalyze oxidation and reduction reactions) and catalyze the insertion of one atom of 
molecular oxygen into their substrate using nucleotides as electron donors (Massey, 1995). 
These monooxygenases are also defined by their use of FAD as a co-enzyme. Apart from 
these three consensus regions, monooxygenases vary significantly, reflecting the wide range 
of enzymes in this family and their variable substrate binding pockets also encompassed 
within this domain (Eppink et aL, 1997). However, MICALs and other monooxygenases 
show significant similarity within these three FAD binding regions and also similar spacing 
of these regions within the monooxygenase domain. 

[0400] Does MICAL bind FAD? A solution of the purified MICAL-flavoprotein 
monooxygenase (FM) domain (expressed in bacteria) is yellow in color, a characteristic of 
flavoproteins. Spectral analysis of purified MICAL-FM shows that it has an absorption 
peak at 452nm and a shoulder at ~358nm (Figure 3B). This is similar to the absorption 
spectra of FAD itself (~450nm and ~360nm; Macheroux, 1999), and to other related 
flavoproteins (e.g., p-Hydroxybenzoate Hydroxylase, Hosokawa and Stanier, 1966; and 
GidA, White et aL, 2001), suggesting that MICAL-FM binds FAD. These results, in 
combination with the sequence homology, raises the possibility that MICAL enzymatic 
activity within the N-terminal conserved domain serves an integral function in plexin 
signaling. 
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EXAMPLE 8 

AN INTACT FAD BINDING MOTIF IS REQUIRED FOR MICAL 
MOTOR AXON GUIDANCE FUNCTIONS 

[0401] This example illustrates that an intact flavoprotein monooxygenases domain is 
necessary for MICAL function in repulsive motor axon guidance. 

[0402] To make the MICAL G * W mutant (SEQ ID NO:20), the dinucleotide binding 
region of Drosophila MICAL was mutated from GXGXXG to WXWXXW, such that the 
glycines were changed to tryptophans. Oligonucleotides containing terminal endogenous 
restriction sites (Mlu I) and base pair substitutions (GGA GCA GGG CCC TGT GGA (SEQ 
ID NO:37) changed to TGG GCA TGG CCC TGT TGG (SEQ ID NO:38)) were used to 
amplify a 1.4kb fragment that was cloned in the correct orientation into the full length 
MICAL rescue construct. The region was sequenced on both stands and substitutions also 
disrupted a restriction site (Apa I) so the mutated construct could also be confirmed by 
restriction analysis. One transgenic, located on the X chromosome, was obtained. 

RESULTS 

[0403] The glycine residues within the GXGXXG motif of FAD binding proteins are 
essential for allowing the FAD binding and enzymatic activity (Wierenga et al, 1986; Dym 
and Eisenberg, 2001). To test the necessity of MICAL FAD binding, and potential 
enzymatic activity, for plexin-mediated repulsive axon guidance the three glycine residues 
within the FAD Fingerprint 1 motif of MICAL were mutated to tryptophan 
(GAGPCGL (SEQ ID NO : 3 9)-* WAWPC WL (SEQ ID NO:40): mutations which in 
related flavin-containing monooxygenases disrupt FAD binding but do not alter the overall 
structure of the protein (Kubo et al., 1997; Lawton and Philpot, 1993; Wierenga et al., 
1986). The resulting construct, MICAL 0 * w , was used for in vivo neuronal expression in 
Drosophila. Transgenic flies containing the UAS -MIC AL G_ * w transgene were generated 
and immunohistochemical and Western analyses confirmed that MICAL G " >W was expressed 
at levels comparable to those of our wild-type "short" MICAL variant that was used to 
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rescue MICAL mutant motor axon guidance phenotypes . Unlike neuronal expression of 
MICAL in a homozygous Df(3R)swp2MICAL mutant background, which rescues all ISNb 
and SNa defects, one copy of the neuronal MICAL G * W rescues none of these defects (Table 
1). This strongly suggests that activity of the MICAL monooxygenase domain is necessary 
for normal MICAL function. 

[0404] Since MICAL G * W contains an intact plexin interacting domain but is functionally 
inactive, we predicted that it would exert a dominant-negative effect on motor axon 
projections in a wild-type genetic background, binding to PlexA but blocking signaling in a 
manner similar to the MICALMyr-CT construct. When one copy of the MICAL G * W 
reporter construct was used to express MICAL G * W in all neurons in a wild-type genetic 
background we observed highly penetrant ISNb, SNa, and CNS longitudinal connective 
defects (Table 1) providing further evidence that MICAL G "* W is being expressed neuronally 
and is likely able to bind PlexA. However, though many of these defects resemble 
phenotypes observed in Semala, PlexA, or MICAL LOF mutants (Table 1), a significant 
fraction (ISNb: >44%, SNa: 38%) were strikingly distinct (Table 1). For example, though it 
was observed that ISNb and SNa axon guidance phenotypes consistent with MICAL LOF 
phenotypes (Table 1), these phenotypes were often more severe. They include defects in 
which axons bypass their muscle targets but then appear to defasciculate in inappropriate 
places and project into adjacent segments. Interestingly, also observed were ISNb and SNa 
axon guidance phenotypes consistent with MICAL GOF, but these phenotypes also 
appeared more severe and included, severely defasciculated and tangled axons. Finally, 
phenotypes were observed that were unlike MICAL or PlexA LOF or GOF mutants, 
including axons projecting along the entire length of the muscle 6/7 cleft and dramatic 
axonal wandering within muscle fields. These phenotypes suggest that expression of 
MICAL G_ * W leads to defects not explained by a simple elevation or diminution of PlexA 
signaling activity. 
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[0405] In summary, these results show the necessity of an intact flavoprotein 
monooxygenase domain for MICAL function in repulsive motor axon guidance. 

EXAMPLE 9 

FLAVOPROTEIN MONOOXYGENASE INHIBITORS NEUTRALIZE 
VERTEBRATE SEMA3 A AXONAL REPULSION 

[0406] This example identifies the gallic acid derivatives EGCG and EC as inhibitors of 
semaphorin-mediated axon repulsion. 

Vertebrate In vitro Repulsion and Collapse Assays 

[0407] DRG repulsion assays were performed as described (Messersmith et aL, 1995). 
(EGCG), (EC), L-NAME, allopurinol, (Sigma) were dissolved in vehicle (PBS), protected 
from light, and then added to the culture media to final concentrations. Rotenone (Sigma) 
was dissolved in 95% EtOH and then added to the culture media (final EtOH concentration 
was below 0.1% and had no effect on axon outgrowth). 

[0408] Inhibitors specific for nitric oxide synthase (N-nitro-L-arginine methylester (L- 
NAME); Comoletti et aL, 2001), xanthine oxidase (allopurinol (Alio); Jonakait et aL, 2000), 
or mitochondrial electron transport (NADH dehydrogenase; rotenone (Rote); Frantseva et 
aL, 2001) were used at concentrations previously shown to be effective in cell culture 
conditions (see Figure 7). The effect of EGCG on growth cone collapse was performed by 
culturing DRGs using standards techniques (Fan et al, 1993). After 48 hours in culture, 
DRGs were incubated in media containing EGCG or vehicle (PBS) for 3 hours prior to a 1 
hour application of InM AP-Sema3A or non-AP-Sema3A containing growth media. 
Scoring was done by splitting explants into quadrants and scoring all growth cones as either 
collapsed or not-collapsed. 

RESULTS 

[0409] The MIC ALs may be susceptible to small molecule inhibitors that affect their 
ability to oxidize their substrate. Some gallic acid derivatives, including the green tea 
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component (-)-epigallocatechin gallate (EGCG), are potent and selective inhibitors of two 
flavoproteinmonooxygenases: squalene epoxidase (SE) and p-hydroxybenzoate 
hydroxylase (pHBH) (Abe et al., 2000a; Abe et al., 2000b). 

[0410] All available evidence points to the plexin cytoplasmic domain as an essential 
signal transducing domain for signaling class 3 semaphorin repulsion (Cheng et al., 2001; 
Takahashi and Strittmatter, 2001). Sema3A appears to utilize neuropilin-1 in combination 
with A class plexins to signal repulsive guidance. To ask whether selective flavoprotein 
monooxygenase inhibitors can neutralize semaphorin-mediated repulsion in vertebrates, in 
vitro rat DRG growth cone repulsion assays were employed using Sema 3 A-secreting 293 
cells (Figure 7A; Messersmith et al., 1995). NGF-dependent DRG axons exhibit little to no 
outgrowth towards Sema3 A-secreting 293 cell aggregates (Figures 4C). However, when 25 
mM EGCG is added to the growth media Sema3 A repulsion was completely neutralized 
(Figures 4C). EGCG attenuation of Sema3A-mediated repulsion is dose-dependent (Figure 
4C). We also asked whether (-)-epicatechin (EC), a compound structurally similar to 
EGCG but a poor inhibitor of SE (Abe et al., 2000b), had a similar effect on Sema-3A- 
mediated repulsion. Like EGCG, EC was capable of completely neutralizing Sema-3A- 
dependent repulsion in a dose-dependent manner, but a much higher EC concentration was 
required (Figures 4C). To address the possibility that a general inhibition of oxidation- 
reduction mechanisms by these reagents underlies this attenuation of Sema3A repulsion, 
selective inhibitors of other redox enzymes present in neurons were analyzed for an effect 
on Sema3A-mediated repulsion. No attenuation of Sema 3 A mediated axonal repulsion was 
observed using inhibitors specific for nitric oxide synthase (N-nitro-L-arginine methylester 
(L-NAME)), xanthine oxidase (allopurinol (Alio), or mitochondrial electron transport 
(NADH dehydrogenase; rotenone (Rote), at concentrations previously shown to be effective 
in cell culture conditions (Figure 4C). DRG axons and Sema3A-secreting 293 cells 
appeared normal following growth in the presence of all but one of these inhibitors. In 
some explants we noticed an adverse effect on survival of DRGs treated with rotenone, but 
axons in those rotenone-treated explants that survived, although somewhat thinner than 
normal, were robustly repelled (Figure 4C). The amount and biological activity of Sema3A 
produced by 293 cells in the presence of all inhibitors was similar as assessed using a DRG 
growth cone collapse assay (Figure 4B), showing that none of these inhibitors had an 
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adverse effect on the ability of 293 cells to produce active Sema 3A. It was also determined 
in separate experiments that 25mM EGCG dramatically abrogates Sema3 A-mediated 
growth cone collapse in NGF-dependent DRG neurons. Taken together, our results support 
a role for flavoenzymes and oxidation-reduction mechanisms in semaphorin-mediated axon 
guidance. 

EXAMPLE 10 

FURTHER CONSIDERATIONS REGARDING THE ASSOCIATION OF MICALS 
AND SEMAPHORIN-MEDIATED AXONAL GUIDANCE 

[041 1] This example provides further insight into the association of MICALs and 
semaphorin-mediated axonal guidance. 

[0412] Neuronal growth cone guidance depends on the ability of various guidance cue 
receptors to regulate cytoskeletal dynamics in response to the local presentation of ligands. 
It was shown herein that proteins belonging to the MICAL family of cytosolic, multi- 
domain, flavoprotein monooxygenases are required for certain plexin-mediated semaphorin 
axon guidance events. MICALs associate with plexins and contain several conserved 
domains that provide the potential for interactions with both growth cone cytoskeletal 
components and many signaling proteins intimately involved in their regulation. Our 
results suggest that MICALs directly participate in plexin signaling through the action of 
their flavoprotein monooxygenase domain. These observations provide a framework for 
dissecting the molecular basis of semaphorin-meditated neuronal guidance and also a 
potential target for attenuating their repulsive action. 

[0413] Genetic and biochemical results provided herein support an essential role for 
Drosophila MICAL in mediating PlexA/Sema-la repulsive guidance events required for 
motor axon pathfinding. Future experiments will establish whether MICAL mediates PlexB 
signaling, and if so, whether this occurs directly or indirectly. Drosophila MICAL is an 
orthologue of a mammalian MICAL-1 protein. It is shown herein that that there are at least 
three vertebrate MICAL orthologues (MICALs 1, 2, and 3). We also identify here a family 
of MICAL-like proteins that lack the conserved N-terminal MICAL monooxygenase 
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domain. Expression and interaction data herein support the idea that MICALs mediate 
plexin signaling in vertebrates. In addition, flavoprotein monooxygenase inhibitors block 
Sema3A-mediated repulsion and collapse of NGF-dependent DRG axons — repulsive 
interactions dependent on A class plexins including Plexin A3. Future genetic and 
biochemical analysis will establish the role of vertebrate MICALs in neuronal and non- 
neuronal plexin signaling. 

[0414] The highly conserved -500 amino acid N-terminal MICAL domain contains 
signature amino acid sequences of the flavoprotein monooxygenase family of 
oxidoreductases. Biochemical and genetic analyses herein strongly suggest that MICALs 
contain functional FAD binding monooxygenase domains required for mediating plexin 
signaling. In support of this idea, it was observed that inhibition of flavoprotein 
monooxygenase enzymatic activity dramatically attenuates semaphorin-mediated axon 
repulsion and growth cone collapse. However, though the inhibitors we used, ECGC and 
EC, have a high degree of selectivity for flavoprotein monooxygenases, similar 
concentrations of EGCG inhibit other enzymes including steroid 5a-reductase, NADPH- 
cytochrome P450 reductase, telomerase, matrix metalloproteinases MMP-2 and MMP-9, 
and phenol sulfotransferase (Abe et al., 2000a; Abe et al., 2000b). Although most of these 
enzymes are unlikely to be expressed in the growth cones of DRG axons, potential non- 
specific effects of these inhibitors cannot be ruled out despite their demonstrated selectivity 
for monooxygenases. Taken together with our in vivo Drosophila experiments showing a 
requirement for the MICAL FAD binding region in Sema-la mediated axon repulsion, these 
data suggest redox signaling plays an important role in vertebrate semaphorin-mediated 
axonal repulsion. 

[0415] Flavoprotein monooxygenases specifically catalyze the oxidation of a number of 
substrates, and in some contexts they can function as oxidases and generate reactive oxygen 
species (Massey, 1994). Results herein suggest that MICALs are flavoproteins most similar 
to the flavoprotein monooxygenase family of oxidoreductases, but a complete 
understanding of the chemical nature of the reactions catalyzed by MICALs awaits future 
study and identification of substrates. The redox regulation of amino acid residues within 
signaling proteins (including kinases, phosphatases, small GTPases, guanylate cyclases, and 
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adapter proteins) and cytoskeletal proteins (including actin, actin binding proteins, 
intermediate filament proteins, and GAP-43) has been shown to modulate their function 
(Finkel, 1998; Kim et aL, 2002; Meng et aL, 2002; Rhee et aL, 2000; Stamler et aL, 2001; 
Thannickal and Fanburg, 2000). In addition, oxidation of actin leads to disassembly of actin 
filaments, instability and collapse of actin networks, reduced ability of actin to interact with 
actin cross-linking proteins, and a decrease in the ability of actin monomers to form 
polymers (Dalle-Donne et aL, 2001a; Dalle-Donne et aL, 2001b; Milzani et aL, 1997). 
Finally, it is also interesting that MICALs have a putative actin filament binding domain 
(CH domain) and that MICAL-1 interacts with vimentin, an intermediate filament protein 
(Suzuki et aL, 2002). 

[0416] It was recently reported that the proline rich region of vertebrate MICAL-1 
interacts with the SH3 domain of the adaptor protein CasL (HEF1) in non-neuronal cells 
(Suzuki et aL, 2002). CasL, along with the related proteins pl30Cas, and Efs (Sin), make 
up the Cas family of proteins (OTSTeill et aL, 2000), which assemble and transduce 
intracellular signals that stimulate cell migration and axon outgrowth. These proteins have 
numerous protein-protein interaction domains, including a Src-homology 3 (SH3) domain, 
multiple SH2-binding sites in their substrate domain, several proline-rich motifs, and a C- 
terminal dimerization module. This structure suggests a role for Cas family proteins as 
docking molecules, and numerous interacting proteins have been identified, including 
kinases (e.g. FAK, Src, and Abl), phosphatases (e.g. PTP-1B, and SHP2), GEFs (e.g. C3G), 
and adaptor proteins (e.g., Nek, Crk, Grb2, and 14-3-3) (O'Neill et aL, 2000). Studies 
indicate that Cas proteins localize mainly to focal adhesions and stress fibers, and that they 
are required in integrin-dependent cell migration and actin filament assembly. Cas proteins, 
therefore, may play an important role in plexin-mediated repulsive and attractive guidance 
events. 

[0417] In conclusion, characterized herein is a gene family conserved from invertebrates 
to vertebrates, with proteins whose structure and function strongly suggest that redox 
signaling is important for semaphorin-mediated axonal repulsion. The results herein also 
suggest that protein oxidation could be a general means for inhibiting axonal growth. Given 
the presence of high amounts of reactive oxygen species and other oxidants in the spinal 



WO 03/066821 



PCT/US03/03551 



135 

cord after injury (Juurlink and Paterson, 1998) regulation of redox signaling using 
antioxidants and specific enzyme inhibitors may be a powerful approach for encouraging 
axonal regeneration. 
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What is claimed is: 

1. An isolated polypeptide comprising an N-terminal MICAL domain, a calponin 
homology domain, a LIM domain, a proline rich region, and a plexin interacting region, 
wherein the polypeptide has monooxygenase activity. 

2. An isolated polypeptide of claim 1, wherein the polypeptide is a mammalian 
MICAL polypeptide. 

3. An isolated polypeptide of claim 2, wherein the isolated polypeptide is human 
MICAL- 1, human MICAL-2, or human MICAL-3. 

4. An isolated polypeptide of claim 3, wherein the polypeptide comprises an amino 
acid sequence as set forth in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 

5. An isolated polypeptide of claim 1, wherein the polypeptide comprises an 
N-terminal MICAL domain having at least about 50% sequence identity to the N-terminal 
amino acids 1-500 of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 

6. An isolated polypeptide of claim 1 , wherein the polypeptide is a Drosophila 
MICAL polypeptide. 

7. An isolated polypeptide of claim 6, wherein the polypeptide is set forth in 
SEQ ID NO:8. 

8. An isolated polypeptide of claim 1, wherein the polypeptide is a MICAL isoform. 

9. An isolated polypeptide of claim 1, wherein the isolated polypeptide is at least 
90% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 

SEQ ID NO: 10, or SEQ ID NO: 12. 
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10. An isolated polypeptide of claim 1, wherein the polypeptide comprises from N- 
terminal to C-terminal, an N-terminal MICAL domain, a calponin homology domain, a first 
variable MICAL region, a LEVI domain, a proline rich region, and a plexin interacting 
region. 

11. An isolated polypeptide comprising a plexin interacting region at least 90% 
identical to a plexin interacting region of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO: 10, or SEQ ID NO: 12, wherein the polypeptide has plexin 
interacting activity. 

12. An isolated polypeptide of claim 11, wherein the polypeptide comprises a plexin 
interacting region of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 

SEQ ID NO:10, or SEQ ID NO:12, or a conservative variant thereof. 

13. An isolated polypeptide of claim 11, with the proviso that the polypeptide does 
not have monooxygenase activity. 

14. An isolated polypeptide of claim 11, wherein the polypeptide comprises the 
plexin interacting region of Drosophila MICAL-like polypeptide, or the plexin interacting 
region of human MICAL-like polypeptide 1, or 2, or a conservative variant thereof. 

15. An isolated polypeptide of claim 14, wherein the polypeptide has the amino acid 
sequence of SEQ ID NO: 14, SEQ ID NO: 16, or SEQ ID NO: 18. 

16. An isolated polypeptide comprising an N-terminal MICAL domain of 
Drosophila MICAL 1, or the N-terminal MICAL domain of human MICAL 1, 2, or 3, or a 
conservative variant thereof. 

17. An isolated polypeptide of claim 16, wherein the polypeptide has 
monooxygenase activity. 
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18. An isolated polypeptide comprising a calponin homology domain at least 90% 
identical to the calponin homology domain of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6 3 
SEQ ID NO:8, SEQ ID NO:10, or SEQ ID NO: 12, and wherein the polypeptide is involved 
in actin filament binding. 

19. An isolated polypeptide comprising a LIM domain at least 90% identical to the 
LIM domain of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 

SEQ ID NO: 10, or SEQ ID NO: 12, and wherein the polypeptide specifically interacts with a 
LIM-binding protein. 

20. An isolated polypeptide comprising a proline rich region at least 90% identical 
to the proline rich region of SEQ ED NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 
SEQ ID NO: 10, or SEQ ID NO:12, and wherein the polypeptide interacts with a 
polypeptide comprising an SH3-domain 

21 . An isolated polynucleotide comprising a nucleotide sequence encoding a 
polypeptide of claim 1. 

22. An isolated polynucleotide of claim 21, wherein the polynucleotide encodes a 
mammalian MICAL polypeptide. 

23. An isolated polynucleotide of claim 22, wherein the polynucleotide encodes a 
polypeptide that is at least 90% identical to SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 

24. An isolated polynucleotide of claim 23, wherein the polynucleotide encodes a 
polypeptide as set forth in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 

25. An isolated polynucleotide of claim 21, wherein the polynucleotide encodes a 
MICAL polypeptide comprising an N-terminal MICAL domain having monooxygenase 
activity, and at least 50% sequence identity to the N-terminal 500 amino acids of human 
MICAL 1 polypeptide. 
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26. An isolated polynucleotide of claim 21, wherein the polynucleotide encodes a 
Drosophila MICAL polypeptide. 

27. An isolated polynucleotide of claim 26, wherein the polynucleotide encodes an 
amino acid sequence as set forth in SEQ ID NO:8, SEQ ID NO:10, or SEQ ID NO:12. 

28. A vector comprising a polynucleotide of claim 15. 

29. A vector of claim 28, wherein the vector is a recombinant expression vector. 

30. A vector of claim 28, wherein the vector is a viral vector. 

31. A host cell comprising a polynucleotide encoding the polypeptide of claim 1 
operably linked to a heterologous promoter. 

32. A host cell comprising a vector of claim 28. 

33. The host cell of claim 32, wherein the host cell is a stem cell. 

34. The host cell of claim 32, wherein the host cell is a neuronal lineage cell. 

35. An antibody or antigen binding fragment thereof that specifically binds the 
polypeptide of claim 1. 

36. The antibody or antigen binding fragment thereof of claim 35 which is human 
or humanized. 

37. The antibody or antigen binding fragment thereof of claim 35 which is an 
intrabody. 
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38. The antibody or antigen binding fragment thereof of claim 35 which specifically 
binds the N-terminal MICAL domain of said polypeptide. 

39. The antibody or antigen binding fragment thereof of claim 35 which specifically 
binds plexin interacting region of said polypeptide. 

40. A method for identifying an agent that affects axonal guidance regulatory 
activity, comprising contacting an isolated polypeptide of claim 1, or a cell recombinantly 
expressing a polypeptide of claim 1, with a candidate agent, and comparing the axonal 
guidance regulatory activity in the presence and absence of the agent, wherein a difference 
in activity is indicative of an agent that affects axonal guidance regulatory activity. 

41. The method of claim 40, wherein the agent inhibits axonal guidance regulatory 
activity. 

42. The method of claim 40, wherein the agent is a small molecule, an antisense 
polynucleotide, a MICAL-like polypeptide or fragment thereof, a mutant MICAL 
polypeptide, an anti-MICAL antibody, a double stranded RNA, or a peptidomimetic. 

43. The method of claim 40, wherein the agent is a monooxygenase inhibitor. 

44. The method of claim 43, wherein the anti-oxidant is a flavonoid. 

45. The method of claim 44, wherein the flavonoid is a gallic acid derivative. 

46. A method for screening for an agent that modulates an activity of a MICAL 
polypeptide, said method comprising (a) contacting the isolated polypeptide of claim 1 with 
a candidate agent and (b) comparing said activity of the polypeptide of claim 1 in the 
presence or absence of said candidate agent, wherein a difference in said activity indicates 
that the agent modulates the activity of the MICAL polypeptide. 
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47. The method of claim 46, wherein said activity is monooxygenase activity. 

48. The method of claim 46, wherein said activity is plexin A-binding activity. 

49. The method of claim 46, wherein the method is a cell-free assay. 

50. A method for screening for an agent that modulates an activity of a MICAL 
polypeptide, said method comprising (a) contacting a cell expressing the polypeptide of 
claim 1 with a candidate agent and (b) comparing said activity of the polypeptide of claim 1 
in the presence or absence of said candidate agent, wherein a difference in said activity 
indicates that the agent modulates the activity of the MICAL polypeptide. 

51. The method of claim 50, wherein the activity is monooxygenase activity. 

52. The method of claim 50, wherein the activity is plexin A-binding activity. 

53. The method of claim 50, wherein the cell is a neuron. 

54. The method of claim 50, wherein the cell is an immune cell. 

55. The method of claim 50, wherein the cell has a transformed phenotype. 

56. The method of claim 50, wherein the cell is a cardiac cell. 

57. A method for screening for an agent that modulates an activity of a MICAL 
polypeptide, said method comprising (a) contacting a cell that recombinantly expresses the 
polypeptide of claim 1 with a candidate agent and (b) comparing a phenotypic or 
physiological trait of said cell in the presence or absence of said candidate agent, wherein a 
difference in said phenotypic or physiological trait indicates that the agent modulates the 
activity of the MICAL polypeptide. 
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58. The method of claim 57, wherein the phenotypic or physiological trait involves 
dynamics of the cytoskeleton. 

59. The method of claim 57, wherein the phenotypic or physiological trait is axon 
guidance. 

60. The method of claim 57, wherein the phenotypic or physiological trait is cell 
proliferation or invasiveness. 

61 . The method of claim 57, wherein the phenotypic or physiological trait is an 
immune response. 

62. A method for screening for an agent that modulates the expression of a MICAL 
polypeptide, the method comprising (a) contacting a cell with a candidate agent; and (b) 
comparing the expression of the polypeptide of claim 1 in the presence or absence of the 
candidate agent, wherein a difference in the expression indicates that the agent modulates 
the expression of the MICAL polypeptide. 

63. The method of claim 62, wherein the level of mRNA encoding MICAL is 
compared. 

64. The method of claim 62, wherein the level of the MICAL polypeptide is 
compared. 

65. A polynucleotide that specifically hybridizes to a polynucleotide of claim 15, 
wherein the polynucleotide is at least 15 nucleotides in length. 

66. A polynucleotide of claim 65, wherein the polynucleotide inhibits expression of 
a polynucleotide that encodes a polypeptide of claim 1. 
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67. A polynucleotide of claim 65, wherein the polynucleotide is at least 90% 
identical to a complementary polynucleotide of a polynucleotide encoding a polypeptide of 
SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, or 
SEQIDNO:12. 

68. A polynucleotide of claim 65, wherein the polynucleotide specifically hybridizes 
to a polynucleotide encoding a polypeptide of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO:10, or SEQ ED NO:12. 

69. A double-stranded RNA molecule comprising a first RNA strand that 
specifically hybridizes to an mRNA encoding a MICAL polypeptide and a second RNA 
strand that is the reverse complement of said first strand, wherein said double-stranded RNA 
molecule is at least 15 base pairs in length. 

70. An isolated polypeptide or a functional peptide portion thereof, comprising a 
calponin homology domain, a LEVI domain, a proline rich region, and a plexin interacting 
region, and having plexin-interacting activity. 

71. An isolated polypeptide of claim 70, wherein the polypeptide comprises a 
calponin homology domain, followed by a first variable region, followed by a LIM domain, 
followed by a proline rich region, and followed by a plexin interacting region. 

72. An isolated polypeptide of claim 70, wherein the polypeptide is at least 90% 
identical to an amino acid sequence as set forth in SEQ ID NO:14, SEQ ID NO:16, or 
SEQIDNO:18. 

73. An isolated polypeptide of claim 72, wherein the polypeptide has an amino acid 
sequence as set forth in SEQ ID NO:14, SEQ ID NO:16, or SEQ ID NO:18. 

74. An isolated polypeptide of claim 70, wherein the polypeptide is a mammalian 
polypeptide. 
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75. An isolated polypeptide of claim 70, wherein the polypeptide is a human 
polypeptide. 

76. An isolated polypeptide of claim 70, wherein the polypeptide is a Drosophila 
polypeptide. 

77. An isolated polynucleotide encoding a polypeptide according to claim 70, or a 
functional peptide portion thereof. 

78. An isolated polynucleotide of claim 77, wherein the polynucleotide encodes a 
mammalian MICAL-like polypeptide. 

79. An isolated polynucleotide of claim 77, wherein the polynucleotide encodes a 
polypeptide that is at least 90% identical to an amino acid sequence as set forth in 

SEQ ID NO:14, SEQ ID NO:16, or SEQ ID NO:18. 

80. An isolated polynucleotide of claim 77, wherein the polynucleotide encodes a 
polypeptide comprising a calponin homology domain, followed by a first non-conserved 
region, followed by a LIM domain, followed by a second non-conserved region, followed 
by a proline rich region, and followed by a plexin interacting region. 

81. An isolated polynucleotide of claim 79, wherein the polynucleotide encodes a 
polypeptide having an amino acid sequence of SEQ ID NO: 14, SEQ ID NO: 16, or 

SEQ ID NO: 18. 

82. A vector comprising a polynucleotide of claim 77. 

83. A vector of claim 75, wherein the vector is a recombinant expression vector. 
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84. A recombinant host cell comprising the polynucleotide of claim 77 operably 
linked to a heterologous promoter. 

85. An isolated polynucleotide of claim 77, wherein the polynucleotide encodes a 
human polypeptide. 

86. An isolated polynucleotide of claim 77, wherein the polynucleotide encodes a 
human polypeptide. 

87. A non-human transgenic animal having a genome comprising a transgene 
comprising a nucleotide sequence encoding a MICAL polypeptide operably linked to a 
heterologous promoter, wherein the non-human transgenic animal expresses the transgenic 
polynucleotide in the central nervous system, and wherein expression levels of the 
transgenic polynucleotide are sufficient to effect an axonal guidance phenotype of the non- 
human organism. 

88. The non-human transgenic animal of claim 87, wherein the non-human 
transgenic animal is a mouse. 

89. The non-human transgenic animal of claim 87, wherein the MICAL polypeptide 
is ectopically expressed. 

90. The non-human transgenic animal of claim 87, wherein the MICAL polypeptide 
is expressed at a greater level in one or more cells of the non-human transgenic animal than 
the MICAL polypeptide is expressed in comparable cells of a comparable non-human 
transgenic animal. 

91. A non-human transgenic animal having a genome comprising a recombinantly 
inactivated nucleotide sequence encoding a MICAL polypeptide that has been 
recombinantly inactivated, wherein the non-human transgenic animal has an altered axon 
guidance phenotype. 
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92. The non-human transgenic animal of claim 91, wherein the non-human 
transgenic animal is a mouse. 

93. The non-human transgenic animal of claim 91, wherein the non-human 
transgenic animal is heterozygous for the nucleotide sequence that has been inactivated. 

94. The non-human transgenic animal of claim 91 , wherein the non-human 
transgenic animal is homozygous for the nucleotide sequence that has been recombinantly 
inactivated. 

95. A method for inhibiting axonal guidance regulatory activity, comprising 
contacting a cell that expresses a polypeptide of claim 1 with a monooxygenase inhibitor, 
thereby inhibiting axonal guidance regulatory activity. 

96. The method of claim 95, wherein the monooxygenase inhibitor is a flavonoid. 

97. The method of claim 96, wherein the flavonoid is a gallic acid derivative. 

98. The method of claim 97, wherein the gallic acid derivative is 
(-)-epigallocatechin gallate (EGCG), (-)-epicatachin (EC), (-)-gallocatechin-3-<9-gallate 
(GCG), (-)-epicatechin-3-0-gallate (ECG), (-)-epigallocatechin (EGG), (+)-gallocatechin 
(GC), theasinensin A, 3"-0-methyl-EGCG, 3"-0-methyl-ECG, 3"-0-methyl-GCG, (-)- 
epigallocatechin (EGG), (l)-gallocatechin (GC), gallic acid, catechin, rc-octyl gallate, or 
rc-cetyl gallate. 

99. The method of claim 95, wherein the contacting is performed in vitro. 

100. The method of claim 95, wherein the contacting is performed in vivo. 
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101. A method for affecting a semaphorin-mediated process, comprising contacting 
a cell that expresses a polypeptide of claim 1 with an effective amount of a monooxygenase 
inhibitor, thereby affecting the semaphorin-mediated process. 

102. The method of claim 101, wherein the agent inhibits semaphorin la-PlexA- 
mediated repulsive axon guidance. 

103. The method of claim 102, wherein the cell is a neuron. 

104. The method of claim 102, wherein the cell is an immune cell, a cancer cell, or 
a cardiac cell. 

105. The method of claim 101, wherein the monooxygenase inhibitor is a flavonoid. 

106. The method of claim 105, wherein the flavonoid is a gallic acid derivative. 

107. The method of claim 106, wherein the gallic acid derivative is 
(-)-epigallocatechin gallate (EGCG), (-)-epicatachin (EC), (-)-gallocatechin-3-0-gallate 
(GCG), (-)-epicatechin-3-<9-gallate (EGG), (-)-epigallocatechin (EGC), (+)-gallocatechin 
(GC), theasinensin A, 3"-Omethyl-EGCG, 3"-Omethyl-ECG, 3"-0-methyl-GCG, (-)- 
epigallocatechin (EGC), (l)-gallocatechin (GC), gallic acid, catechin, rc-octyl gallate, or 
tt-cetyl gallate. 

108. A method for treating a neurological condition in a subject in need thereof, the 
method comprising contacting in the subject, a cell of the central nervous system (CNS) or 
peripheral nervous system (PNS) having a disrupted axonal connection or a cell that affects 
axonal growth of the CNS cell or the PNS cell, with an amount of an agent that modulates 
the activity or expression of a MICAL polypeptide, the amount being effective to modulate 
axonal guidance or axon out-growth regulatory activity. 
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109. The method of claim 108, wherein the cell is contacted with the agent for a 
length of time effective for allowing axon regrowth. 

110. The method of claim 108, wherein the agent is applied for a an length of time 
sufficient to promote neurorestoration. 

111. The method of claim 108, wherein the agent is applied chronically. 

1 12. The method of claim 108, wherein the neurological condition is a spinal cord 

injury. 

113. The method of claim 108, wherein the neurological condition is traumatic 
brain injury. 

1 14. The method of claim 108, wherein the neurological condition is neuropathic 

pain. 

115. The method of claim 108, wherein the neurological condition is Parkinson's 
Disease. 

116. The method of claim 108, wherein the neurological condition is Amyotrophic 
Lateral Sclerosis. 

117. The method of claim 108, wherein the neurological condition is ischemic 

injury. 

118. The method of claim 108, wherein the neurological condition is Alzheimer's 
Disease. 

1 19. The method of claim 108, wherein the neurological condition is Multiple 
Sclerosis. 
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120. The method of claim 108, wherein the neurological condition is a neuropathy 
resulting from a stroke. 

121. The method of claim 108, wherein the agent is an anti-oxidant. 

122. The method of claim 121, wherein the anti-oxidant is a flavonoid. 

123. The method of claim 122, wherein the flavonoid is a gallic acid derivative. 

124. The method of claim 123, wherein the gallic acid derivative is 
(-)-epigallocatechin gallate (EGCG), (-)-epicatachin (EC), (-)-gallocatechin-3-0-gallate 
(GCG), (-)-epicatechin-3-0-gallate (ECG), (-)-epigallocatechin (EGC), (+)-gallocatechin 
(GC), theasinensin A, 3"-0-methyl~EGCG, 3"-<9-methyl-ECG, 3"-0-methyl-GCG, (-)- 
epigallocatechin (EGC), (l)-gallocatechin (GC), gallic acid, catechin, n-octyl gallate, or «- 
cetyl gallate. 

125. The method of claim 108, wherein the method further comprises contacting the 
cell with an agent that affects axon regeneration. 

126. The method of claim 125, wherein the effect is promotion of axon 
regeneration. 

127. The method of claim 126, wherein the agent is a neurotrophin, a mechanical 
bridge, or a stem cell. 

128. The method of claim 127, wherein the mechanical bridge is a cell engineered 
to express a neurotrophic factor, a growth factor, an axon outgrowth promoting molecule, or 
an artificial polymer-based substrate. 
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129. The method of claim 127, wherein the mechanical bridge comprises fetal 

tissue. 

130. The method of claim 127, wherein the mechanical bridge comprises a Schwann 
cell or an olfactory ensheathing glia. 

131. The method of claim 127, wherein the agent is a neurotrophic factor, a growth 
factor, an axon outgrowth promoting molecule, or an artificial polymer-based substrate. 

132. The method of claim 131, wherein the agent is NGF, BDNF, NT-3, or NT-4/5. 

133. The method of claim 131, wherein the agent is CNTF, GDNF, FGF, EGF, or 

PDGF. 

134. The method of claim 131, wherein the agent is netrin, laminin, or collagen. 

135. A method for treating a neurological disorder involving a failure of axon 
regrowth, comprising contacting a neuron having axons that fail to regrow, or surrounding 
tissue, with an agent that neutralizes oxidants for a length of time effective for allowing 
axon regrowth, thereby treating the neurological disorder. 

136. The method of claim 135, wherein the agent is applied for a length of time 
sufficient to promote neurorestoration. 

137. The method of claim 135, wherein the agent is applied for a length of time to 
diminish free radicals chronically after spinal cord injury. 

138. The method of claim 135, wherein the agent is an anti-oxidant. 

139. The method of claim 138, wherein the anti-oxidant is a flavonoid. 

140. The method of claim 139, wherein the flavonoid is a gallic acid derivative. 
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141 . The method of claim 140, wherein the gallic acid derivative is 
(-)-epigallocatechin gallate (EGCG) or (-)-epicatachin (EC). 

142. A method for inducing regrowth and preventing inhibition of an injured 
process of a neuron, comprising altering the levels of reactive oxygen species in the milieu 
of the neuron for an length of time effective for allowing the injured process to reach a 
target. 

143. The method of claim 142, wherein the levels of reactive oxygen species are 
altered for a length of time sufficient to promote neurorestoration. 

144. The method of claim 142, further comprising identifying a site that includes 
the neuron suspected of having an injured process, before altering the levels of reactive 
oxygen species in the milieu of the neuron. 

145. The method of claim 142, wherein the neuronal process is an axon. 

146. The method of claim 142, wherein the neuronal process is a dendrite. 

147. The method of claim 142, wherein levels of reactive oxygen species or other 
oxidation products are decreased. 

148. The method of claim 142, wherein the method further comprises adding an 
agent that promote neuron process regrowth to the milieu of the neuron. 

149. The method of claim 148, wherein the agent is a neurotrophin, a mechanical 
bridge, or a stem cell. 
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150. The method of claim 148, wherein levels of reactive oxygen species are 
lowered for at least 1 month. 

151. The method of claim 148, wherein levels of reactive oxygen species are 
lowered for at least 3 months. 

152. A method for limiting abnormal axon outgrowth, comprising contacting a 
neuron or the milieu of the neuron with an agent that affects oxidation state. 

153. The method of claim 150, wherein the abnormal axon outgrowth is excessive 
axon outgrowth. 
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cgagggtggg 


ccgtctgctt 


tcgttcccac 


aaagaaagca 


3300 


cctcacccca 


gcacagtgcc 


acccctgttc 


atctgggctg 


cctggcagag 


agccttgctg 


3360 


tttacaatta 


aaatgtttct 


gccac 








3385 



<210> 2 

<211> 863 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ala Gly Pro Arg Gly Ala Leu Leu Ala Trp Cys Arg Arg Gin Cys 
15 10 15 

Glu Gly Tyr Arg Gly Val Glu lie Arg Asp Leu Ser Ser Ser Phe Arg 
20 25 30 

Asp Gly Leu Ala Phe Cys Ala lie Leu His Arg His Arg Pro Asp Leu 
35 40 45 

Leu Asp Phe Asp Ser Leu Ser Lys Asp Asn Val Phe Glu Asn Asn Arg 
50 55 60 

Leu Ala Phe Glu Val Ala Glu Lys Glu Leu Gly lie Pro Ala Leu Leu 
65 70 75 80 

Asp Pro Asn Asp Met Val Ser Met Ser Val Pro Asp Cys Leu Ser lie 
85 90 ~ ~ 95 

Met Thr Tyr Val Ser Gin Tyr Tyr Asn His Phe Cys Ser Pro Gly Gin 
100 105 110 
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Ala Gly Val Ser Pro Pro Arg Lys Gly Leu Ala Pro Cys Ser Pro Pro 
115 120 125 



Ser Val Ala Pro Thr Pro Val Glu Pro Glu Asp Val Ala Gin Gly Glu 
130 135 140 



Glu Leu Ser Ser Gly Ser Leu Ser Glu Gin Gly Thr Gly Gin Thr Pro 
145 150 155 160 



Ser Ser Thr Cys Ala Ala Cys Gin Gin His Val His Leu Val Gin Arg 
165 170 175 



Tyr Leu Ala Asp Gly Arg Leu Tyr His Arg His Cys Phe Arg Cys Arg 
180 185 190 



Arg Cys Ser Ser Thr Leu Leu Pro Gly Ala Tyr Glu Asn Gly Pro Glu 
195 200 205 



Glu Gly Thr Phe Val Cys Ala Glu His Cys Ala Arg Leu Gly Pro Gly 
210 215 220 



Thr Arg Ser Gly Thr Arg Pro Gly Pro Phe Ser Gin Pro Lys Gin Gin 
225 230 235 240 



His Gin Gin Gin Leu Ala Glu Asp Ala Lys Asp Val Pro Gly Gly Gly 
245 250 255 



Pro Ser Ser Ser Ala Pro Ala Gly Ala Glu Ala Asp Gly Pro Lys Ala 
260 265 270 



Ser Pro Glu Ala Arg Pro Gin lie Pro Thr Lys Pro Arg Val Pro Gly 
275 280 285 



Lys Leu Gin Glu Leu Ala Ser Pro Pro Ala Gly Arg Pro Thr Pro Ala 
290 295 300 



Pro Arg Lys Ala Ser Glu Ser Thr Thr Pro Ala Pro Pro Thr Pro Arg 
305 310 315 320 



Pro Arg Ser Ser Leu Gin Gin Glu Asn Leu Val Glu Gin Ala Gly Ser 
325 330 335 



Ser Ser Leu Val Asn Gly Arg Leu His Glu Leu Pro Val Pro Lys Pro 
340 345 350 



Arg Gly Thr Pro Lys Pro Ser Glu Gly Thr Pro Ala Pro Arg Lys Asp 
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355 360 365 



Pro Pro Trp lie Thr Leu Val Gin Ala Glu Pro Lys Lys Lys Pro Ala 
370 375 380 



Pro Leu Pro Pro Ser Ser Ser Pro Gly Pro Pro Ser Gin Asp Ser Arg 
385 390 395 400 



Gin Val Glu Asn Gly Gly Thr Glu Glu Val Ala Gin Pro Ser Pro Thr 
405 410 415 



Ala Ser Leu Glu Ser Lys Pro Tyr Asn Pro Phe Glu Glu Glu Glu Glu 
420 425 430 



Asp Lys Glu Glu Glu Ala Pro Ala Ala Pro Ser Leu Ala Thr Ser Pro 
435 440 445 



Ala Leu Gly His Pro Glu Ser Thr Pro Lys Ser Leu His Pro Trp Tyr 
450 455 460 



Gly lie Thr Pro Thr Ser Ser Pro Lys Thr Lys Lys Arg Pro Ala Pro 
465 470 475 480 



Arg Ala Pro Ser Ala Ser Pro Leu Ala Leu His Ala Ser Arg Leu Ser 
485 490 495 



His Ser Glu Pro Pro Ser Ala Thr Pro Ser Pro Ala Leu Ser Val Glu 
500 505 510 



Ser Leu Ser Ser Glu Ser Ala Ser Gin Thr Ala Gly Ala Glu Leu Leu 
515 520 525. 



Glu Pro Pro Ala Val Pro Lys Ser Ser Ser Glu Pro Ala Val His Ala 
530 535 540 



Pro Gly Thr Pro Gly Asn Pro Val Ser Leu Ser Thr Asn Ser Ser Leu 
545 550 555 560 



Ala Ser Ser Gly Glu Leu Val Glu Pro Arg Val Glu Gin Met Pro Gin 
565 570 575 



Ala Ser Pro Gly Leu Ala Pro Arg Thr Arg Gly Ser Ser Gly Pro Gin 
580 585 590 



Pro Ala Lys Pro Cys Ser Gly Ala Thr Pro Thr Pro Leu Leu Leu Val 
595 600 605 
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Gly Asp Arg Ser Pro Val Pro Ser Pro Gly Ser Ser Ser Pro Gin Leu 
610 615 620 



Gin Val Lys Ser Ser Cys Lys Glu Asn Pro Phe Asn Arg Lys Pro Ser 
625 630 635 640 



Pro Ala Ala Ser Pro Ala Thr Lys Lys Ala Thr Lys Gly Ser Lys Pro 
645 650 655 



Val Arg Pro Pro Ala Pro Gly His Gly Phe Pro Leu He Lys Arg Lys 
660 665 670 



Val Gin Ala Asp Gin Tyr He Pro Glu Glu Asp He His Gly Glu Met 
675 680 685 



Asp Thr He Glu Arg Arg Leu Asp Ala Leu Glu His Arg Gly Val Leu 
690 695 700 



Leu Glu Glu Lys Leu Arg Gly Gly Leu Asn Glu Gly Arg Glu Asp Asp 
705 710 715 720 



Met Leu Val Asp Trp Phe Lys Leu He His Glu Lys His Leu Leu Val 
725 730 735 



Arg Arg Glu Ser Glu Leu He Tyr Val Phe Lys Gin Gin Asn Leu Glu 
740 745 750 



Gin Arg Gin Ala Asp Val Glu Tyr Glu Leu Arg Cys Leu Leu Asn Lys 
755 760 765 



Pro Glu Lys Asp Trp Thr Glu Glu Asp Arg Ala Arg Glu Lys Val Leu 
770 775 780 



Met Gin Glu Leu Val Thr Leu He Glu Gin Arg Asn Ala He He Asn 
785 790 795 800 



Cys Leu Asp Glu Asp Arg Gin Arg Glu Glu Glu Glu Asp Lys Met Leu 
805 810 815 



Glu Ala Met He Lys Lys Lys Glu Phe Gin Arg Glu Ala Glu Pro Glu 
820 825 830 



Gly Lys Lys Lys Gly Lys Phe Lys Thr Met Lys Met Leu Lys Leu Leu 
835 840 845 
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Gly Asn Lys Arg Asp Ala Lys Ser Lys Ser Pro Arg Asp Lys Ser 
850 855 860 

<210> 3 

<211> 6008 

<212> DNA 

<213> Homo sapiens 

<400> 3 



ccgggccgcc 


tcgctcgctc 


ccagctctgt 


cagtggcccg 


cggggcccga 


tcgctgcgcc 


60 


cgcggccagg 


gccgaggcag 


gcctgacccg 


gggccgggca 


gcccgcgcga 


ettteggaac 


120 


atggcaaccc 


gtgtgtgtct 


catcccagaa 


agagaagact 


ttaaccactg 


tgatgcctga 


180 


gaatccagtg 


tgacgtttct 


ccagatactt 


catgctgttc 


acctgtgtcc 


tcgccgcacc 


240 


actgccgcac 


acgactcctg 


aaccatgggg 


gaaaacgagg 


atgagaagca 


ggcccaggcg 


300 


gggcaggttt 


ttgagaactt 


tgtccaggca 


tccacgtgca 


aaggtaccct 


ccaggccttc 


360 


aacattctca 


cacgacacct 


ggacctagac 


cctctggacc 


acagaaactt 


ttattccaag 


420 


ctcaagtcca 


aggtgaccac 


ctggaaagcc 


aaagccctgt 


ggtacaaatt 


ggataagcgt 


480 


ggttcccaca 


aagagtataa 


gcgagggaag 


tcgtgcacga 


acaccaagtg 


tctcatagtt 


540 


gggggaggac 


cctgtggctt 


gcgcactgcc 


attgaacttg 


cctacctggg 


agecaaagtg 


600 


gtcgtggtgg 


agaagaggga 


ctccttctcc 


cggaacaacg 


tgctacacct 


ctggcctttc 


660 


accatccatg 


accttcgggg 


gg tgggagcc 


aagaagttct 


atgggaagtt 


ctgtgctggc 


720 


tccatcgacc 


atatcagtat 


tcgccaacta 


cagctcatcc 


tattcaaggt 


ggccctgatg 


780 


ctgggagttg 


aaatccatgt 


gaatgtggag 


ttcgtgaagg 


ttctagagee 


tcctgaagat 


840 


caagaaaatc 


aaaaaattgg 


ctggcgggca 


gaatttctcc 


ctacagacca 


ttctctgtcg 


900 




t* t" rr^i f**rri~ i~ 

I— Ly Cll_.y L.UCIL. 


P3 f f r~rn"f~~ rrnp 






yyciciyyy i— l-l- 


960 


agaagaaaag 


aattccgtgg 


gaagctggcg 


attgccatca 


ccgccaactt 


cataaacaga 


1020 


aacagcacag 


cggaagccaa 


ggtggaagag 


attagtggtg 


tggctttcat 


cttcaatcag 


1080 


aaattttttc 


aggaccttaa 


agaagaaaca 


ggc at agate 


ttgagaacat 


tgtttactac 


1140 


aaggactgca 


cccactattt 


tgtaatgaca 


gecaagaage 


agagectget 


cgacaaaggt 


1200 


gtcatcatta 


acgactacat 


cgacacagag 


atgctgctgt 


gtgcggagaa 


cgtgaaccaa 


1260 


gacaacctgc 


tatcctatgc 


ccgggaagct 


gcagactttg 


ccaccaacta 


ccagctgcca 


1320 


tccttagact 


ttgccatgaa 


ccactatggg 


cagectgatg 


tggccatgtt 


tgactttacc 


1380 


tgcatgtatg 


cctcagagaa 


cgcggccctg 


gtgegggage 


ggcaggegea 


ccagctgctc 


1440 


gtggcccttg 


tgggtgacag 


cttgcttgag 


ccattttggc 


ccatgggtac 


aggctgtgcc 


1500 
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cgtggcttcc 


tggcagcctt 


tgacacggca 


tggatggtga 


agagctggaa 


ccagggcacc 


1560 


cctcccctgg 


agctgctggc 


tgaaagggaa 


agtctctacc 


ggctgttacc 


tcagacaacc 


1620 


ccggagaaca 


tcaacaagaa 


ctttgagcag 


tacacgttgg 


acccagggac 


acggtaccca 


1680 


aacctcaact 


cacactgtgt 


caggccccat 


caggtgaagc 


atttgtatat 


cactaaggag 


1740 


ctggagcact 


accctctcga 


gagactgggc 


tcggtgagga 


gatctgtcaa 


cctctccagg 


1800 


aaggagtcag 


atatccggcc 


cagcaagctc 


ctgacctggt 


gccagcagca 


gacagagggc 


1860 


taccagcatg 


tcaacgtcac 


cgacctgacc 


acatcctggc 


gcagtgggtt 


ggccctgtgt 


1920 


gccatcatcc 


accgcttccg 


gcctgagctc 


atcaactttg 


actctttgaa 


tgaagatgat 


1980 


gctgtggaga 


ac'aaccagct 


cgcatttgat 


gtggccgagc 


gagagtttgg 


gatccctcca 


2040 


gtgaccacgg 


gcaaagagat 


ggcatctgcc 


caggagcctg 


acaagctcag 


catggtcatg 


2100 


tacctctcca 


agttctacga 


gctcttccgg 


ggcaccccac 


tgaggcccgt 


ggattcttgg 


2160 


cgcaaaaact 


atggagaaaa 


tgctgacctc 


agcttggcca 


aatcatccat 


ttctaataac 


2220 


tatctcaacc 


tcacatttcc 


aaggaagagg 


actccacggg 


tggatggtca 


aaccggagag 


2280 


aatgacatga 


acaaacggag 


acggaagggc 


ttcaccaacc 


tggacgagcc 


ttcaaacttt 


2340 


tccagccgta 


gcttgggctc 


caatcaagag 


tgtgggagca 


gtaaggaagg 


tggaaatcag 


2400 


aacaaagtca 


agtccatggc 


gaatcagctg 


ctggccaagt 


ttgaggagag 


cactcggaac 


2460 


ccctcactca 


tgaagcagga 


acgccgtgtc 


tcagggatag 


gtaagccggt 


cctgtgctct 


2520 


tcctccggcc 


ctcctgttca 


ctcttgctgc 


cccaagccgg 


aggaggccac 


acccagccca 


2580 


tcacctcctc 


tgaaaaggca 


gttcccctct 


gtggtcgtga 


cggggcacgt 


gctcagagag 


2640 


ctcaagcaag 


tgtctgctgg 


cagtgagtgc 


ctgagcagac 


cttggagagc 


cagagccaag 


2700 


tctgacctac 


agctgggtgg 


gacagaaaat 


ttcgctaccc 


tgccttctac 


ccgcccgagg 


2760 


gcgcaggctc 


tttccggggt 


gctgtggcgg 


ctgcagcaag 


tggaggaaaa 


gattctccag 


2820 


aagagggctc 


agaacttggc 


caacagggaa 


tttcacacaa 


agaacattaa 


ggagaaggcg 


2880 


gctcaccttg 


cctccatgtt 


tggacacggg 


gatttcccgc 


agaataaact 


gctctctaaa 


2940 


ggcctgtctc 


atactcatcc 


tccatctcct 


ccctctcgcc 


ttccgtctcc 


tgatccagct 


3000 


gcttcttcct 


ctccatcaac 


tgttgactct 


gcttctcctg 


ccagaaagga 


aaagaagtca 


3060 


ccttcagggt 


tccattttca 


tcccagccat 


ttgagaacag 


tgcatcctca 


gctgacggta 


3120 


gggaaagtgt 


ccagcggaat 


aggggctgca 


gctgaagtcc 


tggtcaatct 


gtacatgaat 


3180 


gatcacagac 


ctaaggccca 


ggccacctct 


ccagacctgg 


aatctatgcg 


aaagtcattt 


3240 


ccccttaacc 


tgggaggcag 


cgacacgtgt 


tacttctgta 


agaaacgtgt 


gtacgtgatg 


3300 


gaacggctga 


gcgccgaggg 


ccacttcttc 


caccgggagt 


gtttccgctg 


cagcatctgt 


3360 
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gccaccacct 


tgcgcctggc 


cgcctacacc 


tttgactgcg 


atgaaggcaa 


attttactgc 


3420 


aagcctcact 


tcattcactg 


taaaaccaat 


agcaaacaac 


ggaagagacg 


ggcagagttg 


3480 


aagcaacaaa 


gagaggagga 


ggcaacatgg 


caagagcagg 


aagcccctcg 


gagagacact 


3540 


cccaccgaaa 


gttcttgcgc 


agtggccgcc 


attggcaccc 


tggaaggcag 


ccccccagtt 


3600 


catttcagcc 


ttccagtgct 


acacccactt 


cttggccacc 


ccatctgggg 


gaaggacagg 


3660 


agctggacag 


gccaagagct 


atctcccttg 


gctggagaag 


accgggaaaa 


agggagtact 


3720 


ggagccagga 


aggaagaaga 


gggagggcca 


gtgctggtaa 


aggagaagtt 


gggcctgaag 


3780 


aagttagtcc 


tcacccagga 


gcagaagacc 


atgttgttgg 


attggaatga 


ctccatccct 


3840 


gagagtgtgc 


acctcaaagc 


tggggagcga 


atttcccaga 


aaagtgctga 


gaatggtaga 


3900 


ggaggccgtg 


tgctaaaacc 


agtccgcccc 


ctgctgctcc 


ctagggcagc 


aggagagccc 


3960 


ctgccaaccc 


agagaggggc 


tcaggagaag 


atggggaccc 


ctgcggaaca 


agctcaaggg 


4020 


gagcgaaacg 


tgcctccacc 


caagtcccca 


ctgcggctca 


tagccaatgc 


catccgaagg 


4080 


tctctagagc 


ccctcctttc 


caactctgaa 


ggcgggaaga 


aggcctgggc 


caagcaagaa 


4140 


tccaaaactt 


tgcccacaca 


ggcctgcact 


cgctcattca 


gccttcggaa 


aaccaattcc 


4200 


aataaagacg 


gggaccagca 


ttcccctggg 


agaaaccagt 


cctcagcctt 


tagccctcct 


4260 


gaccctgccc 


tccgcaccca 


cagtttgccc 


aatcggccat 


ccaaggtctt 


tcctgcactt 


4320 


aggtccccac 


cctgcagcaa 


gattgaagaa 


gtccccacac 


tcctcgagaa 


agtgagtttg 


4380 


caagagaact 


tcccagatgc 


ttctaagcct 


ccaaagaaaa 


gaatctcact 


tttttcctcc 


4440 


ctcagactca 


aagacaaatc 


ttttgagagt 


ttcctccaag 


aatccagaca 


aagaaaggac 


4500 


atcagggacc 


tctttggcag 


ccccaagagg 


aaggtgctgc 


ctgaagatag 


tgcgcaggcc 


4560 


ctggagaagc 


tgctgcagcc 


tttcaaaagc 


acctccctgc 


gccaggcagc 


tcctcctcct 


4620 


cctcctcctc 


ctcctcctcc 


tcctcctcct 


cctcctcccc 


ctacagcggg 


aggtgcagac 


4680 


tccaagaact 


ttcccctcag 


agcacaggta 


acagaggctt 


cctcttctgc 


ctcttcaacc 


4740 


tcctcctcct 


ctgcagatga 


agaatttgat 


ccccagcttt 


ccttgcagtt 


aaaggagaag 


4800 


aagacactta 


gaagaagaaa 


gaagctagaa 


aaagcaatga 


agcagttggt 


taagcaagaa 


4860 


gaattgaaaa 


gactctataa 


acre t c acr ci c c 


atccagaggc 


aQctacraQcra 


cicrtcfQaacfacf 


4920 


cggcagaggg 


cttctgagat 


ccagggtgtg 


aggctggaga 


aggcgttgcg 


aggagaagca 


4980 


gattcaggca 


cacaggatga 


agcacagctt 


ttgcaggaat 


ggtttaagct 


ggttctggag 


5040 


aagaataaat 


taatgcgata 


tgagtcggag 


ctcctaatca 


tggcccagga 


actggaatta 


5100 


gaagatcatc 


aaagcagact 


ggagcagaaa 


ctgagagaga 


aaatgctcaa 


99 a ggagagc 


5160 
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cagaaagatg 


agaaggatct 


aaacgaagag 


caagaagtat 


tcaccgagct 


gatgcaagtg 


5220 


attgagcaaa 


gggacaaact 


cgtcgattcc 


ttagaggaac 


aacgcatcag 


agaaaaagcc 


5280 


gaggaccagc 


actttgaaag 


cttcgtattc 


tccagaggct 


gtcagctgag 


caggacttga 


5340 


ggaggcccgt 


agtccctctc 


cctggctgca 


cgttgggacc 


ggatcaggcc 


aagtgcacca 


5400 


cacaccctca 


tgggtctttc 


tgcaggattt 


atcatccctg 


gaacttaagt 


tataccttac 


5460 


tgcatttttt 


aaaattaaaa 


ttctcttgca 


cgcatggcag 


cttcccaagg 


ttcttccaga 


5520 


gattcaaatg 


aagaaaaccc 


aaagactctt 


tggcaattgg 


cagtcaactt 


cagccaggct 


5580 


ctcagactgg 


aggtgttgtt 


ggcagatgac 


cagcattgtt 


ttccctagaa 


agtgacacaa 


5640 


agacttgact 


ttcctgctac 


ttttatcatt 


ttccttccca 


attcattgag 


ttacatactt 


5700 


taagattttt 


gagaagctgc 


cttttcatta 


atatatccat 


atttgccttt 


ttttgtatgg 


5760 


atgaccagtt 


tccaaatgtc 


agaaagaagc 


agccgcagtt 


taaagattag 


gttaatattt 


5820 


aaattgtgtt 


tccagagaaa 


gaggagaaac 


cttgagatta 


ctgattacat 


aaagcaaata 


5880 


actcatatag 


caggtgttaa 


ttcaatccag 


ggtgaattta 


atttaccagg 


tgcatttata 


5940 


agccttaata 


taatatacat 


aagcaatgag 


agcttaatag 


aacatttgag 


ccttaatttt 


6000 


attttaag 












6008 



<210> 4 

<211> 1633 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Gly Glu Asn Glu Asp Glu Lys Gin Ala Gin Ala Gly Gin Val Phe 
15 10 15 



Glu Asn Phe Val Gin Ala Ser Thr Cys Lys Gly Thr Leu Gin Ala Phe 
20 25 30 



Asn lie Leu Thr Arg His Leu Asp Leu Asp Pro Leu Asp His Arg Asn 
35 40 45 



Phe Tyr Ser Lys Leu Lys Ser Lys Val Thr Thr Trp Lys Ala Lys Ala 
50 55 60 



Leu Trp Tyr Lys Leu Asp Lys Arg Gly Ser His Lys Glu Tyr Lys Arg 
65 70 75 80 



Gly Lys Ser 



Cys Thr Asn Thr Lys Cys Leu lie 
85 90 



Val Gly Gly Gly Pro 
95 
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Cys Gly Leu Arg Thr Ala lie Glu Leu Ala Tyr Leu Gly Ala Lys Val 
100 105 110 



Val Val Val Glu Lys Arg Asp Ser Phe Ser Arg Asn Asn Val Leu His 
115 120 125 



Leu Trp Pro Phe Thr lie His Asp Leu Arg Gly Leu Gly Ala Lys Lys 
130 135 140 



Phe Tyr Gly Lys Phe Cys Ala Gly Ser lie Asp His lie Ser lie Arg 
145 150 155 160 



Gin Leu Gin Leu lie Leu Phe Lys Val Ala Leu Met Leu Gly Val Glu 
165 170 175 



He His Val Asn Val Glu Phe Val Lys Val Leu Glu Pro Pro Glu Asp 
180 185 190 



Gin Glu Asn Gin Lys He Gly Trp Arg Ala Glu Phe Leu Pro Thr Asp 
195 200 205 



His Ser Leu Ser Glu Phe Glu Phe Asp Val He He Gly Ala Asp Gly 
210 215 220 



Arg Arg Asn Thr Leu Glu Gly Phe Arg Arg Lys Glu Phe Arg Gly Lys 
225 230 235 240 



Leu Ala He Ala He Thr Ala Asn Phe He Asn Arg Asn Ser Thr Ala 
245 250 255 



Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin 
260 265 270 



Lys Phe Phe Gin Asp Leu Lys Glu Glu Thr Gly He Asp Leu Glu Asn 
275 280 285 



He Val Tyr Tyr Lys Asp Cys Thr His Tyr Phe Val Met Thr Ala Lys 
290 295 300 



Lys Gin Ser Leu Leu Asp Lys Gly Val He He Asn Asp Tyr He Asp 
305 310 315 320 



Thr Glu Met Leu Leu Cys Ala Glu Asn Val Asn Gin Asp Asn Leu Leu 
325 330 335 
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Ser Tyr Ala Arg Glu Ala Ala Asp Phe Ala Thr Asn Tyr Gin Leu Pro 
• 3 ^0 345 ' 350 

Ser Leu Asp Phe Ala Met Asn His Tyr Gly Gin Pro Asp Val Ala Met 
355 360 365 

Phe Asp Phe Thr Cys Met Tyr Ala Ser Glu Asn Ala Ala Leu Val Arg 
370 375 380 

Glu Arg Gin Ala His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu 
385 3 90 395 ^ 1 400 

Leu Glu Pro Phe Trp Pro Met Gly Thr Gly Cys Ala Arg Gly Phe Leu 
405 410 415 

Ala Ala Phe Asp Thr Ala Trp Met Val Lys Ser Trp Asn Gin Gly Thr 
420 425 ^ 430 

Pro Pro Leu Glu Leu Leu Ala Glu Arg Glu Ser Leu Tyr Arg Leu Leu 
435 440 445 

Pro Gin Thr Thr Pro Glu Asn lie Asn Lys Asn Phe Glu Gin Tyr Thr 
450 455 460 

Leu Asp Pro Gly Thr Arg Tyr Pro Asn Leu Asn Ser His Cys Val Arg 
465 470 475 ~ 480 

Pro His Gin Val Lys His Leu Tyr He Thr Lys Glu Leu Glu His Tyr 
485 490 495 

Pro Leu Glu Arg Leu Gly Ser Val Arg Arg Ser Val Asn Leu Ser Arg 
500 505 510 

Lys Glu Ser Asp He Arg Pro Ser Lys Leu Leu Thr Trp Cys Gin Gin 
515 520 525 

Gin Thr Glu Gly Tyr Gin His Val Asn Val Thr Asp Leu Thr Thr Ser 
530 535 540 

Trp Arg Ser Gly Leu Ala Leu Cys Ala He He His Arg Phe Arg Pro 
545 550 555 " 560 

Glu Leu He Asn Phe Asp Ser Leu Asn Glu Asp Asp Ala Val Glu Asn 
565 570 575 



Asn Gin Leu Ala Phe Asp Val Ala Glu Arg Glu Phe Gly He Pro Pro 
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580 585 590 



Val Thr Thr Gly Lys Glu Met Ala Ser Ala Gin Glu Pro Asp Lys Leu 
595 600 605 



Ser Met Val Met Tyr Leu Ser Lys Phe Tyr Glu Leu Phe Arg Gly Thr 
610 615 620 



Pro Leu Arg Pro Val Asp Ser Trp Arg Lys Asn Tyr Gly Glu Asn Ala 
625 630 635 640 



Asp Leu Ser Leu Ala Lys Ser Ser lie Ser Asn Asn Tyr Leu Asn Leu 
645 650 655 



Thr Phe Pro Arg Lys Arg Thr Pro Arg Val Asp Gly Gin Thr Gly Glu 
660 665 . 670 



Asn Asp Met Asn Lys Arg Arg Arg Lys Gly Phe Thr Asn Leu Asp Glu 
675 680 685 



Pro Ser Asn Phe Ser Ser Arg Ser Leu Gly Ser Asn Gin Glu Cys Gly 
690 695 700 



Ser Ser Lys Glu Gly Gly Asn Gin Asn Lys Val Lys Ser Met Ala Asn 
705 710 715 ~ 720 



Gin Leu Leu Ala Lys Phe Glu Glu Ser Thr Arg Asn Pro Ser Leu Met 
725 730 735 



Lys Gin Glu Arg Arg Val Ser Gly lie Gly Lys Pro Val Leu Cys Ser 
740 745 750 



Ser Ser Gly Pro Pro Val His Ser Cys Cys Pro Lys Pro Glu Glu Ala 
755 760 765 



Thr Pro Ser Pro Ser Pro Pro Leu Lys Arg Gin Phe Pro Ser Val Val 
770 775 780 



Val Thr Gly His Val Leu Arg Glu Leu Lys Gin Val Ser Ala Gly Ser 
785 790 795 800 



Glu Cys Leu Ser Arg Pro Trp Arg Ala Arg Ala Lys Ser Asp Leu Gin 
805 810 815 



Leu Gly Gly Thr Glu Asn Phe Ala Thr Leu Pro Ser Thr Arg Pro Arg 
820 825 830 
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Ala Gin Ala Leu Ser Gly Val Leu Trp Arg Leu Gin Gin Val Glu Glu 
835 840 845 



Lys He Leu Gin Lys Arg Ala Gin Asn Leu Ala Asn Arg Glu Phe His 
850 855 860 



Thr Lys Asn He Lys Glu Lys Ala Ala His Leu Ala Ser Met Phe Gly 
865 870 875 880 



His Gly Asp Phe Pro Gin Asn Lys Leu Leu Ser Lys Gly Leu Ser His 
885 890 895 



Thr His Pro Pro Ser Pro Pro Ser Arg Leu Pro Ser Pro Asp Pro Ala 
900 905 910 



Ala Ser Ser Ser Pro Ser Thr Val Asp Ser Ala Ser Pro Ala Arg Lys 
915 920 925 



Glu Lys Lys Ser Pro Ser Gly Phe His Phe His Pro Ser His Leu Arg 
930 935 940 



Thr Val His Pro Gin Leu Thr Val Gly Lys Val Ser Ser Gly He Gly 
945 950 955 960 



Ala Ala Ala Glu Val Leu Val Asn Leu Tyr Met Asn Asp His Arg Pro 
965 970 975 



Lys Ala Gin Ala Thr Ser Pro Asp Leu Glu Ser Met Arg Lys Ser Phe 
980 985 990 



Pro Leu Asn Leu Gly Gly Ser Asp Thr Cys Tyr Phe Cys Lys Lys Arg 
995 1000 1005 



Val Tyr Val Met Glu Arg Leu Ser Ala Glu Gly His Phe Phe His 
1010 1015 1020 



Arg Glu Cys Phe Arg Cys Ser He Cys Ala Thr Thr Leu Arg Leu 
1025 1030 1035 



Ala Ala Tyr Thr Phe Asp Cys Asp Glu Gly Lys Phe Tyr Cys Lys 
1040 1045 1050 



Pro His Phe He His Cys Lys Thr Asn Ser Lys Gin Arg Lys Arg 
1055 1060 1065 
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Arg Ala Glu Leu Lys Gin Gin Arg Glu Glu Glu Ala Thr Trp Gin 
1070 1075 1080 

Glu Gin Glu Ala Pro Arg Arg Asp Thr Pro Thr Glu Ser Ser Cys 
1085 1090 1095 

Ala Val Ala Ala He Gly Thr Leu Glu Gly Ser Pro Pro Val His 
1100 1105 1110 

Phe Ser Leu Pro Val Leu His Pro Leu Leu Gly Met Leu Leu Asp 
1115 1120 1125 

Trp Asn Asp Ser He Pro Glu Ser Val His Leu Lys Ala Gly Glu 
1130 1135 H40 

Arg He Ser Gin Lys Ser Ala Glu Asn Gly Arg Gly Gly Arg Val 
1145 115 0 H55 

Leu Lys Pro Val Arg Pro Leu Leu Leu Pro Arg Ala Ala Gly Glu 
1160 1165 1170 

Pro Leu Pro Thr Gin Arg Gly Ala Gin Glu Lys Met Gly Thr Pro 
1175 iiso H85 

Ala Glu Gin Ala Gin Gly Glu Arg Asn Val Pro Pro Pro Lys Ser 
119° H95 1200 

Pro Leu Arg Leu He Ala Asn Ala He Arg Arg Ser Leu Glu Pro 
1205 1210 1215 

Leu Leu Ser Asn Ser Glu Gly Gly Lys Lys Ala Trp Ala Lys Gin 
1220 1225 1230 

Glu Ser Lys Thr Leu Pro Thr Gin Ala Cys Thr Arg Ser Phe Ser 
1235 1240 1245 

Leu Arg Lys Thr Asn Ser Asn Lys Asp Gly Asp Gin His Ser Pro 
1250 1255 1260 

Gly Arg Asn Gin Ser Ser Ala Phe Ser Pro Pro Asp Pro Ala Leu 
1265 1270 1275 



Arg Thr His Ser Leu Pro Asn Arg Pro Ser Lys Val Phe Pro Ala 
1280 1285 1290 
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Leu Arg Ser Pro Pro Cys Ser Lys lie Glu Glu Val Pro Thr Leu 
12 ^5 1300 - 1305 

Leu Glu Lys Val Ser Leu Gin Glu Asn Phe Pro Asp Ala Ser Lys 
1310 1315 1320 

Pro Pro Lys Lys Arg lie Ser Leu Phe Ser Ser Leu Arg Leu Lys 
1325 1330 1335 

Asp Lys Ser Phe Glu Ser Phe Leu Gin Glu Ser Arg Gin Arg Lys 
1340 1345 1350 

Asp lie Arg Asp Leu Phe Gly Ser Pro Lys Arg Lys Val Leu Pro 
1355 1360 ~ 1365 

Glu Asp Ser Ala Gin Ala Leu Glu Lys Leu Leu Gin Pro Phe Lys 
1370 1375 1380 

Ser Thr Ser Leu Arg Gin Ala Ala Pro Pro Pro Pro Pro Pro Pro 
1385 1390 1395 

Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Thr Ala Gly Gly Ala 
1400 1405 1410 

Asp Ser Lys Asn Phe Pro Leu Arg Ala Gin Val Thr Glu Ala Ser 
1415 1420 1425 

Ser Ser Ala Ser Ser Thr Ser Ser Ser Ser Ala Asp Glu Glu Phe 
1430 1435 1440 

Asp Pro Gin Leu Ser Leu Gin Leu Lys Glu Lys Lys Thr Leu Arg 
1445 1450 ~ 1455 

Arg Arg Lys Lys Leu Glu Lys Ala Met Lys Gin Leu Val Lys Gin 
1460 1465 1470 

Glu Glu Leu Lys Arg Leu Tyr Lys Ala Gin Ala lie Gin Arg Gin 
1475 1480 1485 

Leu Glu Glu Val Glu Glu Arg Gin Arg Ala Ser Glu He Gin Gly 
1490 1495 1500 

Val Arg Leu Glu Lys Ala Leu Arg Gly Glu Ala Asp Ser Gly Thr 
1505 1510 1515 



Gin Asp Glu Ala Gin Leu Leu Gin Glu Trp Phe Lys Leu Val Leu 
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1520 1525 1530 

Glu Lys Asn Lys Leu Met Arg Tyr Glu Ser Glu Leu Leu lie Met 
1535 1540 1545 

Ala Gin Glu Leu Glu Leu Glu Asp His Gin Ser Arg Leu Glu Gin 
1550 1555 1560 

Lys Leu Arg Glu Lys Met Leu Lys Glu Glu Ser Gin Lys Asp Glu 
1565 1570 1575 

Lys Asp Leu Asn Glu Glu Gin Glu Val Phe Thr Glu Leu Met Gin 
1580 1585 1590 

Val He Glu Gin Arg Asp Lys Leu Val Asp Ser Leu Glu Glu Gin 
1595 1600 1605 

Arg He Arg Glu Lys Ala Glu Asp Gin His Phe Glu Ser Phe Val 
1610 1615 1620 

Phe Ser Arg Gly Cys Gin Leu Ser Arg Thr 
1625 1630 

<210> 5 

<211> 8977 

<212> DNA 

<213> Homo sapiens 

<400> 5 

atggaggaga ggaagcatga gaccatgaac ccagctcatg tcctctttga ccggtttgtc 6 0 

caggccacca cctgcaaggg aaccctcaag gctttccagg agctctgtga ccacctggaa 12 0 

ctaaagccaa aggactaccg ctccttctat cacaagctca agtccaagct taactactgg 18 0 

aaagccaaag ccctctgggc aaaattggac aaacggggca gtcacaaaga ctacaaaaag 24 0 

ggaaaagcgt gcactaacac caagtgtctc atcattgggg ctggcccctg tggtctccgt 300 

acagccatcg acttatcctt actgggggcc aaggtggttg ttattgagaa acgagatgcc 360 

ttctcccgca acaacgtctt gcatctctgg ccattcacca tacatgatct acgaggtctg 420 

ggtgccaaga agttctatgg caagttctgt gctggagcca tcgaccatat cagtatccgt 480 

cagctccaac taatactttt gaaagtagcc ttgatcctag gcattgaaat ccacgtcaat 54 0 

.gtggaattcc aaggacttat acagcctcct gaggaccaag agaatgaacg gataggctgg 600 

cgggcactgg tgcaccccaa gactcatcct gtgtcagagt atgaatttga agtgatcatc 660 

9gtggggatg gtcggaggaa caccttggaa gggtttcgtc ggaaagaatt ccgtggcaaa 72 0 
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ctggccatcg ccatcacggc 


aaattttatc 


aaccgaaata 


caacagcaga 


agctaaagtg 


/ O U 


gaagagatca gtggtgtggc 


ttttatattc 


aaccaaaaat 


ttttccagga 


actgagggaa 


O A A 

o4 U 


gccacaggta ttgacttgga gaacatcgtt 


tactacaaag 


atgacacaca 


ctatttcgtt 


q n n 
yuu 


atgacagcca aaaagcagag 


tttgctggac 


aaaggagtga 


tactacatga 


ctacgccgac 


you 


acagagctcc tgctttcccg agaaaacgtg gaccaggagg ctctgctcag ctatgccagg 








ctgccgtctc 


fcggattttgc 


catcaatcac 




tatgggcagc ccgatgtggc 


catgtttgac 


ttcacttgta 


tgtatgcctc 


cgagaacgcc 


JL±4 U 


gccttggtgc gggagcagaa 


cggacaccag 


ttactagtgg 


ctctggtcgg 


ggacagcctc 


i o n n 


ctagagcctt tctggccaat 


gggaacagga 


atagcccggg 


gctttctagc 


tgctatggac 


t o cz. n 


tctgcctgga tggtccgaag 


ttggtctcta 


ggaacgagcc 


ctttggaagt 


gctggcagag 


*i o o n 


agggaaagta tttacaggtt 


gctgcctcag 


accacccctg 


agaatgtgag 


taagaacttc 


J_ j5 o U 


agccagtaca gtatcgaccc 


tgtcactcgg 


tatcccaata 


tcaacgtcaa 


cttcctccgg 


±44 U 


ccaagccagg tgcgccattt 


atatgatact 


ggcgaaacaa 


aagatattca 


cctggaaatg 


1 c n n 
JLb UU 


gagagcctgg tgaattcccg 


aaccaccccc 


aaattgactc 


gcaatgagtc 


tgtagctcgt 


iDD U 


tcaagcaaac tgctgggttg gtgccagagg cagacagatg gctatgcagg ggtaaacgtg 


i c: o r\ 


acagatctca ccatgtcctg 


gaaaagtggc 


ttggcccttt 


gtgcaattat 


ccatagatac 


T c q n 


cgccctgacc tgatagattt 


tgattctttg 


gatgagcaaa 


atgtggagaa 


gaataaccaa 


X /4 U 


ctggcctttg acattgctga 


gaaggaattg 


ggcatttctc 


ccatcatgac 


aggcaaagaa 


i o n n 


atggcctccg tgggggagcc 


tgataagctg 


tccatggtga 


tgtacctgac 


tcagttctac 


J.ODU 


gagatgttta aggactccct 


cccctctagc 


gacaccttgg 


acctaaatgc 


cgaggagaaa 


noon 


gcagtcctga tagccagcac 


cagatcccct 


atctccttcc 


taagcaaact 


tggccagacc 


n Q Q n 


atctctcgga agcgttctcc 


caaggataaa 


aaggaaaagg 


acttggatgg 


tgctgggaag 




aggagaaaga ccagtcaatc 


agaggaggag 


gaagctcctc 


ggggccacag 


aggagaaaga 


ziuu 


ccgaccctgg tgagcactct 


gacagacagg 


aggatggacg 


ttgccgttgg 


gaaccagaac 




aaagtgaagt acatggcgac 


ccagctgctg gccaaatttg 


aagagaatgc 


gcccgcacag 


noon 


tccatcggca tacggagaca 


gctgacacaa 


gagcgtgggg 


ccagccagcc 


gtcctgctgc 


noon 
z> A O U 


ctgcctgggc aggttcgccc 


tgcccccacc 


ccccggtgga 


aacagggctc 


catgaagaag 


j *± W 


gagttcccgc agaacctggg 


aggcagcgac 


acatgctact 


tctgccagaa 


gcgggtctac 


2400 


gtgatggaga ggctgagtgc 


cgagggcaag 


ttcttccacc 


ggagctgctt 


caagtgcgag 


2460 


tactgcgcca ccaccctgcg 


cctctcggcc 


tacgcctacg 


acatcgagga 


tggtaaattc 


2520 


tactgtaagc cacactactg ctatcgactc 


tctggctacg 


cacaaaggaa 


gagaccggca 


2580 
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gtggctcccc tgtctggaaa ggaggccaaa ggacccctgc aggatggcgc caccacagat 2 640 
gcaaacggac gggccaacgc cgtggccagc tccactgaga gaaccccagg ttcaggcgtg 2 70 0 
aacggcctgg aggagcccag catcgccaag cgactgaggg gcaccccaga gcggatcgag 2760 
ctggagaact accgcctgtc cctgaggcag gctgaggcac tgcaggaggt accggaggag 2820 
actcaggccg agcacaacct gagcagcgtg ctggacacgg gcgccgagga ggacgtcgcc 2880 

agcaggtcag cacgcagggc tgcagggcgc ccacccgcca cacggcccga agagtccagt 2940 

gaagccggga accagaggct ccagcaggtc atgcacgcgg cggatcctct ggagatccag 3000 

gctgacgtgc actggactca tatccgtgag agagaggagg aagagaggat ggcgccggcc 3 0 60 

tctgagtcct ctgcttccgg agccccattg gatgagaatg acctagagga agatgtggac 312 0 

tcagaaccag ccgagataga aggggaggca gcagaggatg gggacccagg ggacactggt 3180 

gctgagctgg atgatgatca gcactggtct gacagcccgt cggatgctga cagagagctg 3240 

cgtttgccgt gcccagctga gggggaagca gagctggagc tgagggtgtc ggaagatgag 3300 

gagaagctgc ccgcctcacc gaagcaccaa gagagaggtc cctcccaagc caccagcccc 3360 

atccggtctc cccaggaatc agctcttctg ttcattccag tccacagccc ctcaacagag 3420 

gggccccaac tcccacctgt ccctgccgcc acccaggaga aatcacctga ggagcgcctt 3480 

ttccctgagc ctttgctccc caaagagaag cccaaagctg atgccccctc ggatctgaaa 3 540 

gctgtgcact ctcccatccg atcacagcca gtgaccctgc cagaagctag gactcctgtc 3 60 0 

tcaccaggga gcccgcagcc ccggccaccc gtggcggcct ccacgccccc acccagccca 3 660 

ctccccatct gctcccagcc ccagccttcc accgaggcca ctgtcccatc ccctacccag 3720 

tcccccatac gcttccagcc tgccccggcc aaaacatcca ccccactggc ccctctccct 3780 

gtccaaagcc aaagtgacac caaggacaga ctgggcagcc cccttgctgt ggatgaggcc 3840 

ctcagacgga gcgacctggt ggaggagttc tggatgaaga gtgcggagat ccgccgcagc 3 900 

ctcgggctca cacctgtgga ccgcagcaag gggcccgagc ccagcttccc cacgcctgcc 3 960 

ttcaggccag tgtccctcaa atcctattcc gttgaaaagt ccccccagga tgagggactc 4 020 

caccttctca agcctctgtc catccccaaa aggctgggcc tgccaaagcc ggaaggcgag 4080 

ccgttgtccc tgccaacccc ccggtccccg tccgacagag agctacgcag cgcccaggag 4140 

gagcgcaggg agctgtccag cagctctggc ctgggcctgc acgggagctc ctccaacatg 42 00 

aagacactgg gcagccagag cttcaacacc tcggactccg ccatgctcac gcccccctcc 42 60 

agcccgcccc caccgccacc cccgggcgag gagcccgcca ccttgcggag gaagctcagg 4320 

gaggccgagc ccaatgcctc ggtggtcccg ccgcccttgc ccgccacctg gatgcggccc 438 0 
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ccccgggagc 


ctgctcagcc 


ccccagagag 


gaggtgcgga 


agtcgtttgt 


ggagagtgtg 


4440 


gaggagattc 


cctttgctga 


tgatgtggag 


gacacctatg 


acgacaagac 


tgaggactca 


4500 


agcctgcagg 


agaaattctt 


cacgcccccg 


tcctgctggc 


cgcgccccga 


gaagcctcgc 


4560 


cacccgcccc 


tggccaagga 


gaacgggagg 


ctgcctgctc 


tggaggggac 


gctgcagcca 


4620 


cagaagaggg ggctgccctt 


ggtgtccgcg 


gaagccaagg 


agttggccga 


ggagcgcatg 


4680 


cgagccaggg 


agaagtccgt 


gaagagccag 


gcgctgcggg 


atgccatggc 


caggcagctg 


4740 


agcaggatgc 


agcagatgga 


gctggcctca 


ggcgccccca 


ggccccgcaa 


ggcgtcctca 


4800 


gcaccctccc 


agggcaagga 


gcgccggcct 


gactccccca 


cacgccccac 


tctcaggggc 


4860 


tccgaggagc 


ccaccctgaa 


gcatgaagcc 


accagcgagg 


aggtcctctc 


cccgccgtcg 


4920 


gactcagggg gcccagatgg 


ctctttcact 


tcatccgagg gctccagtgg 


gaagagcaag 


4980 


aagaggtcgt 


cactcttctc 


cccccgcaga 


aacaagaagg 


aaaaaaaaf c 


caaaggegag 


5040 


ggccggcccc 


cggagaagcc 


cagctccaac 


ctcctaaaaa 




caaacccaag 


5100 


tccctgtgga 


agtccgtctt 


ctccgggtac 


a acra a crcr a c 3 




ggccgacgac 


5160 


aagtcctgcc 


ccagcacccc 


ctccagcggg 


cr ccac crater cr 


artct nap 3 3 


gcacagggtg 


5220 


cttcccgtcg 


taagggcaga 


gctgcagctc 


cggcgccagc 


tgagct tct c 


cgaggactca 


5280 


gacctctcca 


gcgacgatgt 


ccttgagaag 


tcctcacaga 


agtcccggcg 


agagecaaga 


5340 


acctacacgg 


aggaggaact 


gaatgccaag 


ctgacccggc 


gtgtgcaaaa 


ggcagctegg 


5400 


agacaggcca 


agcaggagga 


gcttaagcgg 


ctgeatcgag 


cccagatcat 


ccagcggcag 


5460 


ctgcagcagg 


tggaggagag 


gcagcggcgg 


ctggaggaaa 


ggggcgtggc 


tgtggagaag 


5520 


gcgctccggg gcgaagcagg 


catgggcaag 


aaggacgacc 


ccaagctgat 


gcaggagtgg 


5580 


ttcaagctag tgcaggagaa 


gaacgccatg 


gtgcgctacg 


agteggaget 


gatgatcttt 


5640 


gcccgggagc 


tggagctgga 


agaccggcag 


agtcgactgc 


agcaggagct 


ccgggaacgc 


5700 


atggcagtgg 


aagatcacct 


taagactgag 


gaggagctgt 


cagaagagaa 


gcagattctc 


5760 


aatgagatgc 


tggaggtggt 


ggagcagaga 


gactcactgg 


tggegctget 


ggaggagcag 


5820 


cggctccggg agagagagga 


ggacaaggac 


ctggaggctg 


ccatgctgtc 


caagggcttc 


5880 


agccttaact 


ggtcctgagc 


tcccacccaa 


cgctccattt 


tctgttggca 


tctgcctggc 


5940 


caggcagtgg 


catccaaacc 


acccggagcc 


gcgatctgag 


gaggectgge 


acctccttgg 


6000 


agtttacgct 


cagatgcccg 


tgtgctgctt 


ggaaagtggt 


cgagtcccgc 


gtgcagtggg 


6060 


gagccccagg 


tgacagtggt 


tatctgagac 


ggctccacct 


cctgggagga 


ggcccacctg 


6120 


gacctcccac 


tcagaggagg 


agcacggcgt 


gtatggcatg 


aegcagggga 


ccaccccgcg 


6180 


cgctccctga ggatgtgctg 


gctgtgcccc 


ttttttccac 


tggcacattt 


ggtaagagag 


6240 
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ggaagctgct ccccgtcaga accacagtgc gccgtgcgag gggcactgtc ttcttcatgc 63 00 

tccctggagc accaccaaag aaacgtaaac aataccccac gaaagcaggg tcaggggtca 6360 

gggtgcgatc gagacccagg atgggggcgt ccagtcatgc ccaccccagc atcacaggag 6420 

acatggaggt gcgggcaggc tcctgaatta ttatgcaaat taggaggacg caggaggggt 64 80 

ctgccctcca gccgaacacc acacactgga ccctaagtgg ccaaatgcct gggccgcttg 654 0 

ctggctgtgg cctgaggctt gtgggttgct gcattttgct tgtagttcac aaccattttg 6600 

acactggaaa atgctgactt tgggggacag gatgaggccc tacattctaa gcccccagtt 6660 

ggcagacagg cattgtccct gttccacatt tatgtcggga caggagatga ccttttcctc 672 0 

cgtgtttttc ctgtgtttgc acgttgaaat gaagctgaca acctggcaag acgctcagcc 6780 

gcttcaaacc ctttttgtca attaacttat tttttaatac ttgaaaagaa gtaacttcgt 6840 

ttgtgtatct ttactagagg aactgatcac ctgcgcccgg gtgcgggagc cacagcggca 6900 

tctggtgcgt cctacgcgac ctggtccggg gctgcccggt gctcctcacg tgcatctatt 6960 

tattagcctt tctcttcgta tcactggcct ggctggcatc agggagctgc ccagaacccc 7020 

tgtgtgggtc ctcctcacag ctttctgtcc cctcctccac ccggtgcctg cctcaccctg 7080 

gcgctagacc atctggacca ctcatgtgat gagggtgcat ttccgttctg ttttgggcca 7140 

ggccaacagc agagctgcca ctctcaccct cccagtgaga attccggctc tgcagaactc 7200 

gcccttgtct cagtttgggg gccagggcat caccttcctc cgcacatatg ttaaagaagg 7260 

tttcaggatg ggccctcatc cacacaggcc aagagggtgc aaggtgggac cctggaatca 732 0 

tgtggctggt gagaattccc tcctcccagc ctaagattca cccagacgga aacgcgagtg 7380 

ctgcagtgga tgctgggatg caggctggtg cctctcaagc agatcagcga ctcccccctt 744 0 

ccttccgaag gtgacgggca cctgccttgt gccggatctt catggggaca taaagggcga 7500 

gccccgaatc actagctcct atagccaaac tgttcctttc tcacggttcc gcaccagcct 7560 

ggctgtgtac agctcatcaa gccactgagc taatcggggt ggggtgcttg tccatcaaag 7620 

cagtcagcac atagcccagc gaggagccat ccggaccaga cccgcctgcc aggggcgctc 7680 

cagctgcctg ccgcctccgt gggctggggc cagctgggag cagaggcctg gcccctccaa 7740 

ggcgtcccgc agtggagcag gctgagtggc tgtgctgacc ttgggctttc catgggaacc 78 00 

acactgtgct tcaacttgaa cattcatccc agctgcaaag gagcaaagaa cctgtgtcat 7860 

ccttgtctgt ccagaagctg ccatctctct cccatgcaca tccgggagat aacgccgcta 792 0 

gtggccgcca cagcgctgat tctccacctg ttctcacttg aggcaaaggt ctccttttct 7980 

tttctcttta cttaaaaaag gggagaggag ggtttctaga ttccatcttc aaaccccagt 804 0 
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cgtcccataa 
tctcctcgaa 
tcctgctggc 
gggatctgct 
tcagccttaa 
gacccgagcc 
cccgactgtg 
ccactagttt 
tgtgagaatg 
ggcagggaag 
gcccagactc 
ccctggaggg 
gcttccaggc 
cgattttgac 
acagaaagga 
atgggtctgt 



aattggacgt 

tgccacaaaa 

cccacaacag 

gtaagtggcc 

cactgtggca 

tttgcagaga 

aggacaccca 

gcggttgact 

tgcgcagctc 

caggatggat 

aaggctgact 

caggccggag 

ctgcccagat 

cagcaatgat 

gagttaaaaa 

tttatgcaaa 



gggaaaagac 

ccggtccagc 

ggaaacagtt 

agacgtagca 

tcctccacag 

ggcgccgcag 

gttctagtag 

tgagtactct 

caggcaggtt 

acacatatat 

aaactggagg 

gcgctcccgc 

gatgccttca 

agactcttct 

acattttttt 

ataaaagttt 
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cttcactgcc 

cccggcagag 

tgcaaagtgg 

ggagagccca 

cacacatggc 

agccgcaggc 

agcacttttt 

ggtgacttcc 

ttctctcgga 

cacacacata 

ctgaataccg 

ccctgggctt 

ggcctctgtc 

taaccctttc 

aaaggtggta 

ttcaaat 



tgcgtggcct 

ccgccttcgg 

cttggaagga 

gtgtcacctt 

agcctgcagt 

ctgcgcccca 

ttaaagctcc 

tgcgtcaagc 

gagttaagtc 

aaacaccagg 

tggaggtcca 

gaggatgctg 

cctggcggcc 

aaaataaatt 

acatctgacc 



ttcccagacc 

cccttgtagc 

gtgtggccta 

ctggtcctgg 

ctggaaggtg 

gccttctgct 

cattttgtaa 

gttctcaagc 

ttcccttgaa 

tgcgggagca 

catgcagctt 

caccccgtgg 

atcctcaggc 

tttcagtggg 

cacaaaggga 



8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

8977 



<210> 6 

<211> 1965 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Glu Glu Arg Lys His Glu Thr Met Asn Pro Ala His Val Leu Phe 
1 5 10 15 



Asp Arg Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu Lys Ala Phe 
20 25 30 

Gin Glu Leu Cys Asp His Leu Glu Leu Lys Pro Lys Asp Tyr Arg Ser 
35 40 45 



Phe Tyr His Lys Leu Lys Ser Lys Leu Asn Tyr Trp Lys Ala Lys Ale 
50 55 



60 



Leu Trp Ala Lys Leu Asp Lys Arg Gly Ser His Lys Asp Tyr Lys Lys 
65 70 75 ~ L ^ so 

Gly Lys Ala Cys Thr Asn Thr Lys Cys Leu He He Gly Ala Gly Pro 
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85 90 95 

Cys Gly Leu Arg Thr Ala lie Asp Leu Ser Leu Leu Gly Ala Lys Val 
100 105 110 

Val Val lie Glu Lys Arg Asp Ala Phe Ser Arg Asn Asn Val Leu His 
115 120 125 

Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly Ala Lys Lys 
130 135 140 

Phe Tyr Gly Lys Phe Cys Ala Gly Ala He Asp His He Ser He Arq 
145 150 155 i 6 o 

Gin Leu Gin Leu He Leu Leu Lys Val Ala Leu He Leu Gly He Glu 
165 170 175 

He His Val Asn Val Glu Phe Gin Gly Leu He Gin Pro Pro Glu Asp 
180 185 190 

Gin Glu Asn Glu Arg He Gly Trp Arg Ala Leu Val His Pro Lys Thr 
195 200 205 

His Pro Val Ser Glu Tyr Glu Phe Glu Val He He Gly Gly Asp Gly 
210 215 220 

Arg Arg Asn Thr Leu Glu Gly Phe Arg Arg Lys Glu Phe Arg Gly Lys 
225 230 235 ~ 240 

Leu Ala He Ala He Thr Ala Asn Phe He Asn Arg Asn Thr Thr Ala 
245 250 255 

Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin 
260 265 270 

Lys Phe Phe Gin Glu Leu Arg Glu Ala Thr Gly He Asp Leu Glu Asn 
275 280 285 

He Val Tyr Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys 
290 295 300 

Lys Gin Ser Leu Leu Asp Lys Gly Val He Leu His Asp Tyr Ala Asp 
305 310 315 " ' 320 

Thr Glu Leu Leu Leu Ser Arg Glu Asn Val Asp Gin Glu Ala Leu Leu 
325 330 335 
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Ser Tyr Ala Arg Glu Ala Ala Asp Phe Ser Thr Gin Gin Gin Leu Pro 
340 345 350 

Ser Leu Asp Phe Ala He Asn His Tyr Gly Gin Pro Asp Val Ala Met 
355 360 365 

Phe Asp Phe Thr Cys Met Tyr Ala Ser Glu Asn Ala Ala Leu Val Arq 
370 375 380 

Glu Gin Asn Gly His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu 
385 390 395 ~ 400 

Leu Glu Pro Phe Trp Pro Met Gly Thr Gly He Ala Arg Gly Phe Leu 
405 410 415 

Ala Ala Met Asp Ser Ala Trp Met Val Arg Ser Trp Ser Leu Gly Thr 
420 425 ~ 430 

Ser Pro Leu Glu Val Leu Ala Glu Arg Glu Ser He Tyr Arg Leu Leu 
435 440 445 

Pro Gin Thr Thr Pro Glu Asn Val Ser Lys Asn Phe Ser Gin Tyr Ser 
450 455 460 

He Asp Pro Val Thr Arg Tyr Pro Asn He Asn Val Asn Phe Leu Arg 
465 470 475 480 

Pro Ser Gin Val Arg His Leu Tyr Asp Thr Gly Glu Thr Lys Asp He 
485 490 495 

His Leu Glu Met Glu Ser Leu Val Asn Ser Arg Thr Thr Pro Lys Leu 
500 505 510 

Thr Arg Asn Glu Ser Val Ala Arg Ser Ser Lys Leu Leu Gly Trp Cys 
515 520 525 

Gin Arg Gin Thr Asp Gly Tyr Ala Gly Val Asn Val Thr Asp Leu Thr 
530 535 540 

Met Ser Trp Lys Ser Gly Leu Ala Leu Cys Ala He He His Arq Tyr 
545 550 555 560 

Arg Pro Asp Leu He Asp Phe Asp Ser Leu Asp Glu Gin Asn Val Glu 
565 570 575 
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Lys Asn Asn Gin Leu Ala Phe Asp lie Ala Glu Lys Glu Leu Gly lie 
580 585 590 



Ser Pro lie Met Thr Gly Lys Glu Met Ala Ser Val Gly Glu Pro Asp 
595 600 605 



Lys Leu Ser Met Val Met Tyr Leu Thr Gin Phe Tyr Glu Met Phe Lys 
610 615 620 



Asp Ser Leu Pro Ser Ser Asp Thr Leu Asp Leu Asn Ala Glu Glu Lys 
625 630 635 640 



Ala Val Leu lie Ala Ser Thr Arg Ser Pro He Ser Phe Leu Ser Lys 
645 650 655 



Leu Gly Gin Thr He Ser Arg Lys Arg Ser Pro Lys Asp Lys Lys Glu 
660 665 670 



Lys Asp Leu Asp Gly Ala Gly Lys Arg Arg Lys Thr Ser Gin Ser Glu 
675 680 685 



Glu Glu Glu Ala Pro Arg Gly His Arg Gly Glu Arg Pro Thr Leu Val 
690 695 700 



Ser Thr Leu Thr Asp Arg Arg Met Asp Val Ala Val Gly Asn Gin Asn 
705 710 715 720 



Lys Val Lys Tyr Met Ala Thr Gin Leu Leu Ala Lys Phe Glu Glu Asn 
725 730 735 



Ala Pro Ala Gin Ser He Gly He Arg Arg Gin Leu Thr Gin Glu Arg 
740 745 750 



Gly Ala Ser Gin Pro Ser Cys Cys Leu Pro Gly Gin Val Arg Pro Ala 
755 760 765 



Pro Thr Pro Arg Trp Lys Gin Gly Ser Met Lys Lys Glu Phe Pro Gin 
770 775 780 



Asn Leu Gly Gly Ser Asp Thr Cys Tyr Phe Cys Gin Lys Arg Val Tyr 
785 790 795 " 800 



Val Met Glu Arg Leu Ser Ala Glu Gly Lys Phe Phe His Arg Ser Cys 
805 810 815 
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Phe Lys Cys Glu Tyr Cys Ala Thr Thr Leu Arg Leu Ser Ala Tyr Ala 
820 825 830 

Tyr Asp lie Glu Asp Gly Lys Phe Tyr Cys Lys Pro His Tyr Cys Tyr 
835 840 845 

Arg Leu Ser Gly Tyr Ala Gin Arg Lys Arg Pro Ala Val Ala Pro Leu 
850 855 860 

Ser Gly Lys Glu Ala Lys Gly Pro Leu Gin Asp Gly Ala Thr Thr Asp 
865 870 875 " 880 

Ala Asn Gly Arg Ala Asn Ala Val Ala Ser Ser Thr Glu Arg Thr Pro 
885 890 895 

Gly Ser Gly Val Asn Gly Leu Glu Glu Pro Ser He Ala Lys Arg Leu 
900 905 910 

Arg Gly Thr Pro Glu Arg He Glu Leu Glu Asn Tyr Arg Leu Ser Leu 
915 920 925 

Arg Gin Ala Glu Ala Leu Gin Glu Val Pro Glu Glu Thr Gin Ala Glu 
930 935 940 

His Asn Leu Ser Ser Val Leu Asp Thr Gly Ala Glu Glu Asp Val Ala 
945 950 955 L 960 

Ser Arg Ser Ala Arg Arg Ala Ala Gly Arg Pro Pro Ala Thr Arg Pro 
965 970 975 

Glu Glu Ser Ser Glu Ala Gly Asn Gin Arg Leu Gin Gin Val Met His 
980 985 990 

Ala Ala Asp Pro Leu Glu He Gin Ala Asp Val His Trp Thr His He 
995 1000 1005 

Arg Glu Arg Glu Glu Glu Glu Arg Met Ala Pro Ala Ser Glu Ser 
1010 1015 1020 

Ser Ala Ser Gly Ala Pro Leu Asp Glu Asn Asp Leu Glu Glu Asp 
1025 1030 1035 

Val Asp Ser Glu Pro Ala Glu He Glu Gly Glu Ala Ala Glu Asp 
1040 1045 1050 



Gly Asp 



Pro Gly Asp Thr Gly 



Ala Glu Leu Asp Asp 



Asp Gin His 
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1055 1060 1065 



Trp Ser Asp Ser Pro Ser Asp Ala Asp Arg Glu Leu Arg Leu Pro 
1070 1075 1080 



Cys Pro Ala Glu Gly Glu Ala Glu Leu Glu Leu Arg Val Ser Glu 
1085 1090 1095 



Asp Glu Glu Lys Leu Pro Ala Ser Pro Lys His Gin Glu Arg Gly 
1100 1105 1110 



Pro Ser Gin Ala Thr Ser Pro lie Arg Ser Pro Gin Glu Ser Ala 
1115 1120 1125 



Leu Leu Phe lie Pro Val His Ser Pro Ser Thr Glu Gly Pro Gin 
1130 1135 1140 



Leu Pro Pro Val Pro Ala Ala Thr Gin Glu Lys Ser Pro Glu Glu 
1145 1150 1155 



Arg Leu Phe Pro Glu Pro Leu Leu Pro Lys Glu Lys Pro Lys Ala 
1160 1165 1170 



Asp Ala Pro Ser Asp Leu Lys Ala Val His Ser Pro lie Arg Ser 
1175 1180 1185 



Gin Pro Val Thr Leu Pro Glu Ala Arg Thr Pro Val Ser Pro Gly 
1190 1195 1200 



Ser Pro Gin Pro Arg Pro Pro Val Ala Ala Ser Thr Pro Pro Pro 
1205 1210 1215 



Ser Pro Leu Pro lie Cys Ser Gin Pro Gin Pro Ser Thr Glu Ala 
1220 1225 1230 



Thr Val Pro Ser Pro Thr Gin Ser Pro lie Arg Phe Gin Pro Ala 
1235 1240 1245 



Pro Ala Lys Thr Ser Thr Pro Leu Ala Pro Leu Pro Val Gin Ser 
1250 1255 1260 



Gin Ser Asp Thr Lys Asp Arg Leu Gly Ser Pro Leu Ala Val Asp 
1265 1270 1275 



Glu Ala Leu Arg Arg Ser Asp Leu Val Glu Glu Phe Trp Met Lys 
1280 1285 1290 
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Ser Ala Glu lie Arg Arg Ser Leu Gly Leu Thr Pro Val Asp Arg 
1295 1300 1305 



Ser Lys Gly Pro Glu Pro Ser Phe Pro Thr Pro Ala Phe Arg Pro 
1310 1315 1320 



Val Ser Leu Lys Ser Tyr Ser Val Glu Lys Ser Pro Gin Asp Glu 
1325 1330 1335 



Gly Leu His Leu Leu Lys Pro Leu Ser lie Pro Lys Arg Leu Gly 
1340 1345 1350 



Leu Pro Lys Pro Glu Gly Glu Pro Leu Ser Leu Pro Thr Pro Arg 
1355 1360 1365 



Ser Pro Ser Asp Arg Glu Leu Arg Ser Ala Gin Glu Glu Arg Arg 
1370 1375 1380 



Glu Leu Ser Ser Ser Ser Gly Leu Gly Leu His Gly Ser Ser Ser 
1385 1390 1395 



Asn Met Lys Thr Leu Gly Ser Gin Ser Phe Asn Thr Ser Asp Ser 
1400 1405 1410 



Ala Met Leu Thr Pro Pro Ser Ser Pro Pro Pro Pro Pro Pro Pro 
1415 1420 1425 



Gly Glu Glu Pro Ala Thr Leu Arg Arg Lys Leu Arg Glu Ala Glu 
1430 1435 1440 



Pro Asn Ala Ser Val Val Pro Pro Pro Leu Pro Ala Thr Trp Met 
1445 1450 1455 



Arg Pro Pro Arg Glu Pro Ala Gin Pro Pro Arg Glu Glu Val Arg 
1460 1465 1470 



Lys Ser Phe Val Glu Ser Val Glu Glu lie Pro Phe Ala Asp Asp 
1475 1480 1485 



Val Glu Asp Thr Tyr Asp Asp Lys Thr Glu Asp Ser Ser Leu Gin 
1490 1495 1500 



Glu Lys Phe Phe Thr Pro Pro Ser Cys Trp Pro Arg Pro Glu Lys 
1505 1510 1515 
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Pro Arg His Pro Pro Leu Ala Lys Glu Asn Gly Arg Leu Pro Ala 
1520 1525 1530 



Leu Glu Gly Thr Leu Gin Pro Gin Lys Arg Gly Leu Pro Leu Val 
1535 1540 1545 



Ser Ala Glu Ala Lys Glu Leu Ala Glu Glu Arg Met Arg Ala Arg 
1550 1555 1560 



Glu Lys Ser Val Lys Ser Gin Ala Leu Arg Asp Ala Met Ala Arg 
1565 1570 1575 



Gin Leu Ser Arg Met Gin Gin Met Glu Leu Ala Ser Gly Ala Pro 
1580 1585 1590 



Arg Pro Arg Lys Ala Ser Ser Ala Pro Ser Gin Gly Lys Glu Arg 
1595 1600 1605 



Arg Pro Asp Ser Pro Thr Arg Pro Thr Leu Arg Gly Ser Glu Glu 
1610 1615 1620 



Pro Thr Leu Lys His Glu Ala Thr Ser Glu Glu Val Leu Ser Pro 
1625 1630 1635 



Pro Ser Asp Ser Gly Gly Pro Asp Gly Ser Phe Thr Ser Ser Glu 
1640 1645 1650 



Gly Ser Ser Gly Lys Ser Lys Lys Arg Ser Ser Leu Phe Ser Pro 
1655 1660 1665 



Arg Arg Asn Lys Lys Glu Lys Lys Ser Lys Gly Glu Gly Arg Pro 
1670 1675 1680 



Pro Glu Lys Pro Ser Ser Asn Leu Leu Glu Glu Ala Ala Ala Lys 
1685 1690 1695 



Pro Lys Ser Leu Trp Lys Ser Val Phe Ser Gly Tyr Lys Lys Asp 
1700 1705 1710 



Lys Lys Lys Lys Ala Asp Asp Lys Ser Cys Pro Ser Thr Pro Ser 
1715 1720 1725 



Ser Gly Ala Thr Val Asp Ser Gly Lys His Arg Val Leu Pro Val 
1730 1735 1740 
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Val Arg Ala Glu Leu Gin Leu Arg Arg Gin Leu Ser Phe Ser Glu 
• 1745 1750 1755 



Asp Ser Asp Leu Ser Ser Asp Asp Val Leu Glu Lys Ser Ser Gin 
1760 1765 1770 



Lys Ser Arg Arg Glu Pro Arg Thr Tyr Thr Glu Glu Glu Leu Asn 
1775 1780 1785 



Ala Lys Leu Thr Arg Arg Val Gin Lys Ala Ala Arg Arg Gin Ala 
1790 1795 1800 



Lys Gin Glu Glu Leu Lys Arg Leu His Arg Ala Gin lie lie Gin 
1805 1810 1815 



Arg Gin Leu Gin Gin Val Glu Glu Arg Gin Arg Arg Leu Glu Glu 
1820 1825 1830 



Arg Gly Val Ala Val Glu Lys Ala Leu Arg Gly Glu Ala Gly Met 
1835 1840 1845 



Gly Lys Lys Asp Asp Pro Lys Leu Met Gin Glu Trp Phe Lys Leu 
1850 1855 1860 



Val Gin Glu Lys Asn Ala Met Val Arg Tyr Glu Ser Glu Leu Met 
1865 1870 1875 



lie Phe Ala Arg Glu Leu Glu Leu Glu Asp Arg Gin Ser Arg Leu 
1880 1885 1890 



Gin Gin Glu Leu Arg Glu Arg Met Ala Val Glu Asp His Leu Lys 
1895 1900 1905 



Thr Glu Glu Glu Leu Ser Glu Glu Lys Gin lie Leu Asn Glu Met 
1910 1915 1920 



Leu Glu Val Val Glu Gin Arg Asp Ser Leu Val Ala Leu Leu Glu 
1925 1930 1935 



Glu Gin Arg Leu Arg Glu Arg Glu Glu Asp Lys Asp Leu Glu Ala 
1940 1945 1950 



Ala Met Leu Ser Lys Gly Phe Ser Leu Asn Trp Ser 
1955 1960 1965 



<210> 7 
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<211> 14172 

<212> DNA 

<213> Drosophila 



<400> 7 



atgagccgcc 


aacaccagcg 


gcaccaccag 


cagcatcacc 


acctgccgcc 


gcaccagcaa 


60 


ccgcagcagc 


agatgccgca 


acaacagcag 


cagctgacgg 


cgcagcagca 


gcaacaacag 


120 


cagctgctga 


tggcggagca 


cgcggcggcc 


gcggaggcgg 


cggagctatt 


cgacctgctg 


180 


tgcgtggcca 


caacgatgcg 


ccagatcctg 


gcgctccatc 


gggccatgtg 


cgaggctgtg 


240 


ggattgagac 


cctcgcctct 


gaacgacttc 


tacccacggc 


taaaggccaa ggtgcgttcg 


300 


tggaaggcgc 


aggccctgtg 


gaagaagttc 


gacgccagag 


ctgcccatag agtctacggc 


360 


aagggagctg 


cctgtactgg 


cacacgcgtc 


ctggtcatcg 


gagcagggcc 


ctgtggactg 


420 


cgcaccgcca 


tcgaggccca 


actgctgggc 


gccaaggtgg 


tggtgctgga 


gaaacgcgat 


480 


cgcatcaccc 


ggaacaatgt 


gctccatctg 


tggccattcg 


tcatcacgga 


tctgcgcaac 


540 


ttgggcgcaa 


agaagttcta 


cggcaagttt 


tgcgccggct 


ccatcgatca 


catctccatt 


600 


cggcagctgc 


agtgcatgct 


gctcaaggtg 


gcgctgctcc 


tgggcgtaga 


gatccacgag 


660 


ggagtcagtt 


ttgatcacgc 


tgtagagccc 


tctggcgatg 


gcggcggatg gagggcagct 


720 


gttactcccg 


cagatcatcc 


tgtatctcac 


tacgaattcg 


atgtgttgat 


cggagcggat 


780 


ggcaagcgga 


atatgctgga 


ctttaggagg 


aaggagttcc 


gcgggaagct 


ggccatcgct 


840 


attacagcga 


actttatcaa 


caagaagacg 


gaggcggagg 


ctaaagtaga ggagatcagt 


900 


ggggtggctt 


tcatcttcaa 


ccaggccttc 


ttcaaggagc 


tgtacgggaa gacgggcatc 


960 


gacctggaaa 


acatcgtcta 


ctacaaggac 


gagacgcact 


acttcgtgat 


gacggccaag 


1020 


aagcacagtc 


taattgacaa 


gggcgttatt 


atcgaggata 


tggccgatcc 


cggcgagctt 


1080 


ctcgccccag 


ccaatgtgga 


tacacaaaag 


ctgcacgact 


atgcacgcga 


ggctgcggag 


1140 


ttctccaccc 


aataccaaat 


gccaaacctg 


gagttcgctg 


ttaatcacta 


cggcaaacca 


1200 


gatgtggcca 


tgttcgactt 


cacatcgatg 


tttgccgccg 


agatgtcctg tcgggtgatt 


1260 


gtgcgcaaag 


gagctcgcct 


gatgcagtgc 


ctcgtgggtg 


acagtctgct 


cgagccgttt 


1320 


tggcccactg 


gatcgggttg 


tgcccgtgga 


ttcttatcca 


gcatggatgc 


tgcctatgcc 


1380 


atcaagcttt 


ggtccaaccc 


gcagaacagc 


acacttggcg 


ttctggcgca 


gcgcgaaagc 


1440 


atctaccggc 


tgcttaacca 


gaccacgccg 


gacaccctgc 


agcgggacat 


cagtgcctat 


1500 


accgtggatc 


cggccacgcg 


ctatccgaat 


ctgaacaggg 


agtcggtcaa 


tagctggcag 


1560 


gtcaaacatc 


tggtcgacac 


ggacgacccg 


tccattctgg 


agcagacctt 


catggacacg 


1620 


catgctctgc 


agaccccgca 


tttggacaca 


ccgggcagac 


gcaagcgacg 


cagtggagac 


1680 
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ttgctgcccc 


agggtgccac 


gttgctgaga 


tggataagtg 


cccagctgca 


ttcctatcag 


1740 


tttattcccg 


aactcaagga 


ggcttcggat 


gtgttccgga 


atggacgcgt 


tctgtgtgcg 


1800 


cttatcaatc 


gctatcgtcc 


tgatctcatc 


gactacgctg 


ccaccaagga 


catgagtccc 


1860 


gtggagtgca 


atgagctgtc 


attcgccgtc 


ctagagcgcg 


aactccacat 


cgatcgcgtc 


1920 


atgagtgcca 


aacagtcgct 


ggacttgacc 


gagctggagt 


cgcgaatctg 


gctcaactat 


1980 


ttggaccaga 


tctgcgactt 


gtttcgcggc 


gagatccccc 


atatcaagca 


ccccaagatg 


2040 


gactttagcg 


atttgcgcca 


gaagtatcgt 


atcaaccata 


cgcatgccca 


acccgacttc 


2100 


tccaagctgc 


tggcaacgaa 


acccaaggcc 


aagtcgccga 


tgcaggatgc 


tgtggacata 


2160 


cccacgacag 


tgcagcggcg 


ctcggtgctc 


gaggaggagc 


gagccaagcg 


gcagcgtcgc 


2220 


cacgagcagc 


ttcttaacat 


cggtggaggg 


gcagcaggag 


ccgccgccgg 


agttgccgga 


2280 


agcgggacag 


gaaccacaac 


gcagggtcaa 


aacgatacgc 


cacgccggtc 


caagaagcgc 


2340 


cgtcaggttg 


acaaaaccgc 


caatattgag 


gagcgccagc 


agcgcttgca 


ggagatcgag 


2400 


gagaatcggc 


aggagcggat 


gagcaagcgg 


cgccagcagc 


gctgtcacca 


gacgcagaat 


2460 


ttctacaaga 


gccttcagct 


cctgcaggcg 


ggcaagctct 


tgagggaggg 


tggtgaggcg 


2520 


ggagtggccg 


aggatggcac 


cccattcgag 


gactactcga 


tattcctcta 


ccgccagcag 


2580 


gcgcccgtat 


tcaatgatcg 


cgtcaaggac 


ctcgagcgaa 


agcttctgtt 


tcccgatcgc 


2640 


gaacggggag 


atattccgtc 


ggcattgccg 


cgcacggcgg 


acgagcagtt 


cagcgatcgg 


2700 


attaaaaaca 


tggagcagcg 


gatgacggga 


cgtggtggcc 


ttggtggtga 


caagaagccc 


2760 


aaggatctga 


tgcgggctat 


cggcaagatc 


gactcgaacg 


attggaatgt 


acgcgagatc 


2820 


gagaagaaga 


tcgagctgtc 


gaagaagacg 


gagatccacg 


ggcctaaggg 


ccgcgagaag 


2880 


gttcccaagt 


ggagcaagga 


gcagttccag 


gcgcgacagc 


acaagatgtc 


caagccgcaa 


2940 


cgccaggatt 


cgcgtgaggc 


ggaaaagttc 


aaggacatcg 


accagactat 


ccgcaacttg 


3000 


gacaagcagc 


taaaggaggg 


ccacaatctg 


gatgtgggcg 


agcggggacg 


caacaaagtg 


3060 


gcctccattg 


ccggtcagtt 


tggcaaaaag 


gatgaggcca 


attcggatga 


gaagaacgcg 


3120 


ggcagcagca 


atgccaccac 


caacaccaac 


aacacagtca 


tacccaaatc 


tagttccaag 


3180 


gtggcgctgg 


cctttaaaaa 


gcaggctgcc 


tccgaaaagt 


gccgcttctg 


taagcaaacc 


3240 


gtctacccga 


tggagaagac 


caccgtggag 


ggattggttc 


tgcatcgcaa 


ttgccttaag 


3300 


tgccaccact 


gccacaccaa 


cttgcgtctg 


ggaggctacg 


cctttgatcg 


ggacgatccg 


3360 


cagggccgac 


tttactgcac 


ccaacacttc 


cggttgccac 


ccaaaccgct 


gccgcagcgc 


3420 


accaacaaag 


ccaggaaatc 


cgctgccgct 


caacccgcct 


cgcctgctgt 


accaccaact 


3480 


gcgggatccg 


tacccactgc 


agctgccaca 


tcggagcata 


tggacaccac 


tccacccagg 


3540 
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gaccaggtgg atctactgca gacctcgcga gcaaatgcct 


ctgccgatgc 


catgtccgat 


3600 


gatgaagcca 


atgttatcga 


tgagcacgaa 


tggtctggtc 


gcaacttctt 


gcccgagtcc 


3660 


aacaacgatt 


cccaatcaga 


gctatccagt 


tcagatgaat 


cggatacgga 


atcggattcg 


3720 


gagatgtttg 


aggaggcgga 


tgattcgccg 


tttggtgctc 


agaccctcca 


gctggcgtcg 


3780 


gattggattg 


gaaagcaata 


ctgtgaggac 


agtgatgatt 


ctgacgattt 


ctacgactca 


3840 


agtgaagatg acggcaaaga tgacaccgag ggtgaggaat 


tcaagaaggc 


ccgcgaattg 


3900 


aggcgccagg 


aagttcgcct 


gcagccgttg 


cccgccaatc 


tgcccacaga 


tacggagacc 


3960 


gagaaactca 


agttaaacgt 


agacaataaa 


gaaaacatgg 


cggacggaag 


ctctctgaaa 


4020 


tcgggcaatt 


cctttgagtc 


cgcgcgcagc 


cagccgtcta 


cgcccctgtc 


cacgcccact 


4080 


cgtgtcgaga 


tggagcaact 


ggagcgagat 


gctccacgca 


agtttagcag 


cgaaatcgag 


4140 


gcaatcagcg 


agaaacttta 


ccacatgaac 


aacatggtaa 


agatgaacaa 


ggacctcgag 


4200 


gtgctggcca aggaaaatct ggtcaagagc ggcatcctgc 


gcaagctcac 


gctgaaggag 


4260 


aagtggctgg 


cggagaatgc 


ggccatagca 


gcaggccaaa 


aagtgactcc 


gactcctagt 


4320 


gcgactgctc 


ctgggcttca 


acccaagtcg 


aagttcgatg 


aaaagttcga 


aaaggtagtg 


4380 


agtccacctc 


agccggttgt 


cgaacccaaa 


cccaagcccg 


taatcgattt 


caatttggat 


4440 


gagttaaaac 


cacgtaaacc 


gaactttgag 


gagcgaccca 


aggagcagct 


gcccaggcct 


4500 


gaaagtttga 


aaaaacctcc 


acaacaaaaa 


ccgaagggca 


gcagcacaaa 


cgtgagccgc 


4560 


tccaacagct 


tgaagagtaa 


tgccagcaac 


gggagtccaa 


aggtgaaaaa 


agctcctgtt 


4620 


tcaaataaca 


gcaaaatgca 


aattgaagga 


atccttgata 


ccttgagaaa 


gattcagagc 


4680 


cagaacagta 


gcgatcagga 


tgaagatatg 


gatgtagatg 


aggatgtgga 


aagaaagcca 


4740 


aataaagagc 


tcaacagcaa 


gttgaaggaa 


atccaagcca 


gcagctttgc 


cggcacaatg 


4800 


gaccatatta 


aatcccagct 


gactatgcca 


acggttagtg 


cgcaggcacc 


accctccatg 


4860 


gatctttcca 


agtactttcc 


caaccagaag 


caagagaaga 


gctccaccag 


cagcacaaac 


4920 


aagaatcagg ttaccctgaa ggatgtaaat 


ctggccaagt 


acttccccag 


cagtccagct 


4980 


ccacaacgga 


gaaccgtgga 


aacggtggcc 


gatcgactaa 


agaagtcaca 


gactgaggct 


5040 


gctctagcta 


aaaccaagct 


actggaggat 


caggccaata 


accaggccga 


aaagaccaag 


5100 


aaagaggtcg 


aaaaggaagg 


agagtctaaa 


aaaatcacga 


aaaaggtggc 


agactccaaa 


5160 


gcagtacctc 


caaaacggca 


ggcctcgtta 


gacaccttta 


gcttaaggga gcaccaaatg 


5220 


gatggagcat 


tagacctgac 


caaaaagaag 


ggtcccacca 


aagcgagcgc 


tggtgttaaa 


5280 


aaaccagcca 


aatcagggag 


taccacatcc 


gtgacaaaag 


ccacggccac 


atccgaagga 


5340 
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aagaccatta agattgtaaa gaaaatcgta cccaaaggaa ccaaagctaa aaaggcagca 5400 

gaagcggccc aagaatccgc agtagtagag gctccgccag aaaagaaacc tccaaaagat 5460 

gaagcagagc ggatattgga cgaaattctg ggagatggtg agtatcgttc acccagctcc 5520 

gagtatcagc gcttgttcca ggacgaaaag tcgcctagcg atttgtccga taacatagat 5580 

cggatactcg aggaatccga gttggatgtg gaattgggtc tgccaaaacg cagcagtaag 5640 

aaactggtaa aaaccaaatc cctgggcgag ggtgactttg acatgaagcc ttcaaaagaa 5700 

agacttactg gagtacagaa catacttaag cgatttgagt caatgagttc agtcacctca 5760 

cagaacagtg atgaacaggc ggggttcaaa ttacgtcgca tggagtccac caccagtaat 5820 

ctgagcagct tgacccgttc cagggaatca cttgtctccg tcagcgattc aatgagtgat 588 0 

ttggagaaaa ccatggacta cctccgtaat gaatggcgca atgaggccac caattttctg 5 94 0 

caaaagaagc gtgataaatt ctatgccaaa aaggaggagc aagaaaagga ggctaaaatt 6000 

ctggctaaac ctgatccctt agacaatctt cctgttcaat atcgcgactc caagttggcc 6060 

aagttctttg gcttggccgc tagcaagtca ccagaaaacc gaaagtcccc tattaagaag 612 0 

aagaaatctc cctctaaaac gccaaaggtg accaaagcaa acaattcact ggaggagctg 618 0 

gccaagatta gtaatgttcg ccagaccaag aaggcacagc cgaagactct aaagcctgta 624 0 

gaagtaaaac ctttaaagcc agccagtcca gttccagatg attttgagat tcttgatctg 6300 

ctggaaaaag ccacggaggc caaagaactg gaacgctcga agaccaagag tccagccgta 6360 

gaatctatca gtcagacgcc caaagaagct atagtagaaa tttccttacc agtggaagac 642 0 

attaaaaatc tacccaaaac cgggtgtgat aagtcctcaa acagctctcg ccgtggttcc 648 0 

caatctagtt tgatcatgtc gcgacgacac tcggagattt ctctaaacga gaaactcaac 654 0 

caggacgcac tcgcagcctt gaaccaaata gaaaaagaaa gagaggcgga gcaagtggac 6600 

gaacttttcc agagcatggt ggaggaaatg gagcaggaac cgcagcccac agctattgtc 6660 

gaacctccag aagaagacat cgatgcagat tccttgtgca caacaattag caagagtccc 6720 

agtgcacagc ccgttacggt ggtcaaacgt ggcagttccg aggatcagag catagagaag 678 0 

cttttcagcc acttttccga cgagatgctg gtgaatgtgg agtttgactc taacgatgag 6840 

cttgtaggaa ttacaccaag ggcaacactc gtttcccgaa acacagaaga tcgggactat 6900 

ctggataaac tagagtcttt ggagcgcgat gaggagactt tccaaccggt tgttggggaa 6960 

aaattcatac aggaaaatgt ccaggacgaa gtggacggat tgcatttccc atcacgccca 702 0 

caacggagac ccaaaagcag ttcatcttcc agtgaaccat cacttcct^gt ggctcctcaa 7 080 

aggttggaaa aaaagctatc gaaattagat ccggaggata tgcctccttc tgtacaggat 714 0 

ctattacaac aggtttacca gaagaacatc caacctgaat tagtggaagt aattccagtt 7200 
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gaaggcaaac aaactttaag gttccccagc atgttggcgg aagaagatgt agacgaagtg 7260 

gatcactcta aagaaggaat caagaaaatt gaaacggctc cagaagaggt tcgtaaggta 732 0 

accgagccgg aggatgttgc tcgggtaatt ccaagtccaa tcaaaccgtc aataagtcag 7380 

agcaactccc tcaagagcga aaattcctct ggcagcagtt tagtggaaat accaaaaata 7440 

attgcaccac ccaagtcgag tagcaaggaa aactcttcgg attgggacag ggagaagctg 75 00 

"cccgcatcac ccatgccacg acgaagactt ctgccaaatc aaacgccata taaagcccca 7560 

agcgtggcta gtaaggagag ctccttagag tgggacatgg agaagctacc caacagtccc 762 0 

atgttgccaa ggaggaacaa gatgcgagca atctcaccta gtaccaaccc tgtgcaactt 7680 

ctcaacaatt taccctccga tgtagatgac gaggcggcac agagacgcct tatcgaggat 774 0 

ttcgagcagg agagacgtca ggctctgatc aagcgggatg agaacttcga ggccattgcg 7800 

gcggagcaac gcaggcgcga ctccttacag agcagcagta actcgagcag caaacgcagt 7860 

ttaccaccgc ccacgccgcc catgatggca tcgcgacgcg gtacaactca agacacaaat 792 0 

cgcacccagg ataccgcatc gcggcacgag ggaacgccgc ccatgttcaa gaagctggat 798 0 

gtcgatggca gcggtacatc aatggattcc acctcctgct ccacgcggcg cagttcgttt 8 04 0 

gcatttatag agttacagga taacaaacca gtgattgtgc ccatgcccaa gaaactgaag 8100 

ttgccaaagc cggagccacc taggttcgta cccgagccag tggccactga tgagcctgtg 8160 

cccgaggttt ttcagggtcg cgcttggccc aagacacagt tggaaggaga ggtcgatcta 8220 

ggcgattcgg ataatgaaga tgaaacagaa aagctaaaga aacagctgcc cgaatacgct 828 0 

cgttcggact ctcctccttc ggctgcattc aagaatcgca agtggccgga tgggaaaaca 8340 

gtttttgata aacgagccga atctcttgag gaggaagata tcttcgaagg attaccgtcc 8400 

ccaaggaaaa gaggttctca aagattcatg gacaagccgc gctctcaatc gccacagcct 8460 

ttcaaaccgc tggccaacag ctccaggaaa agctcaaagt cttttagcga ccttaagaaa 852 0 

ggaccctcct tgcaatctct gtcggcgcaa tccagccagg acacggacac actgtccacc 8580 

accacaacag tagccacagc tcgtcctgct agctatgcca actacgagga ccccatggat 8 64 0 

gccagtaccc aagctttgct ggatcgaagc aaacggttac acaaccgcaa aagagatttc 8700 

gtaaacgagc gagtagtgga gcgcaacccc tatatgaggg atgtgcttag gagcacggat 8760 

cgccgtgatt acgacgacgt ggatgaagat ctgactagct acaggccaag acattatgcc 882 0 

agctccacgc taaatcgttt ccccaacacc acaataagaa aaagcaacaa ctacgattac 8880 

ctcagtccca gcagtgatta tttgagcagg agaagctaca taccgagcgc cagtgcaacc 894 0 

agcagttact atccatcgac cacgcgtagc tcccatttga gtgacctgtt ccgacgacgc 9000 
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agccccgcca 
atctcaatcg 
gtacgatcta 
aattctgcca 
gcgcccaaaa 
tccaccatga 
cgtgagaagg 
agaccgttgg 
aatccacggc 
aaaaagagcc 
atccagaagt 
atatcggagt 
ttcctcgatc 
ctaatccgtt 
aaaactggct 
aacgagctga 
ctggccagaa 
ctggttacac 
aagcagttgg 
aaaatcagcg 
gtgaaagaat 
ccaggtggag 
ccggaaaagg 
cccaatatcc 
tccatctgct 
tccacaacta 
gtgcacgaca 
tatattgggg 
ccagcaccgg 
gaaccagtaa 
gttactgctc 



gcggaactgt 
ggctggcctt 
cgaaggttca 
ctgagatatc 
tcttcatcga 
tcggacccaa 
gcggcagcta 
tccaggtgga 
caggagacta 

tggggctaaa 

ccggatccac 
gccagaagct 
gcacaaagtt 
ccgccaccag 
ccagccactc 
acaaggggat 
aaagttcacc 
ctgagaagcc 
tcgatggtag 
aactgcagca 
gcataccaga 
aggaccaaca 
atgtggccga 
catgggaaaa 
caagctcaga 
gtccagatac 
caagtggtgc 
atcccgagga 
cgcttaaaac 
ctgctactcc 
cccctccttt 



atccgcactt 
agatcgagtt 
aaccgagtcc 
caccgactct 
tgacacccat 
tgcagcttcc 
cctccagaag 
tcccaccctg 
cttgctaaac 
aaagcgctat 
ttcagtgctg 
tttgaatccc 
gggcgaaggc 
caatgtgatt 
cacggacaac 
ggaatacacg 
caccacgccg 
aattgacatt 
cgccatggat 
ggaagtgaag 
tatactggga 
gagcctgctg 
acatgagctt 
aagcaagccg 
ctcttctagt 
tcagacagtt 
cctgatgcag 
tgtcaagttc 
gaatgagctg 
ggagccggtg 
gccagagatg 



tccggctacg 
ggccacttga 
gagagcactt 
gagtttgaca 
ctaaggaagc 
gccggactcc 
taccaaccac 
ctcattggca 
aagacggcca 
ctgctgggtg 
gattcacgca 
agcagcgaca 
agccagacga 
aacgatctgc 
gagaaggaaa 
gatgcggtca 
acgaataaga 
atcgatctaa 
gtagatgaac 
gaggaaccca 
cacattaagg 
gagcagtcgg 
tatgaaccgg 
gaggtcatgt 
attgaagaca 
ggcggaaagc 
gtggacagcc 
ttggatatgc 
gatatggagg 
gaatgtacgg 
ggtccaccca 



gcaacaaaga 
ttgaaagtaa 
cacccgacga 
acgatgagat 
ccaccaaggt 
atcagaagca 
aaccgccact 
gccagcgcgc 
gtacggaggg 
agccggccaa 
ttcgcagctt 
taagtgccgg 
cacccggaca 
gcgtggagct 
acgttttcgt 
atgccacgct 
cggtggtcga 
cggcactgga 
gcctcacacc 
agccggatgt 
agggaacggg 
acgaagagaa 
acagtgtgca 
ctaccaccgg 
tccagcacta 
acaatgtgcc 
tgatgattgt 
cggccaatgt 
atgaccaaga 
tcatcgaggc 
aactgaagtt 



gtcgtgcgtt 
atgcacctgg 
agtggagctc 
tatacgccag 
tcagatcaag 
gttggcggcg 
gccacagttt 
tcctcttcag 
tatcgcctca 
tggcaataag 
ccagtcgaac 
catgcgaacc 
gacgaacgaa 
tcggatacag 
gaactgtaag 
gctggaccag 
ggttattgac 
aacgccgaaa 
cgatagcaac 
ctctagggat 
atcgaaggag 
gcgcgactca 
gatccaggtg 
cagcagtggc 
cattttggag 
ccgtttggag 
gaacggaaag 
tattgttccg 
ggcggaggcg 
tgagcgccgt 
cgatagcaaa 



9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
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aatgagaaca 


agatcgagag 


cttgaagaat 


cttccgttga 


tcgtagagag 


caatgtggag 


10920 


cacagtcagg cagtgaaacc 


cattactctt 


aacttaagca 


atctggccag gacgccggat 


10980 


acaccaacca 


cgcccacggc 


gcacgatagc 


gataaaacac 


ccactgggga 


aattctctcg 


11040 


cgaggatctg 


actcagaaac 


cgagcacact 


ggcactggtc 


aggtactaac 


ggagacggaa 


11100 


ctctccgact 


ggacggccga 


cgactgtatc 


tcggagaact 


ttgttgactt 


ggagttcgcg 


11160 


cttaactcta 


acaagggtac 


gataaaacgg 


cgcaaggatc 


gacgacgcag 


tggagcaagc 


11220 


aaacttccca 


gtggcaacga 


ggtaatccac 


gagctggcca 


ggcaggcgcc 


agtggtgcaa 


11280 


atggatggaa 


ttcttagtgc 


catcgacatt 


gatgacatag 


agttcatgga cacgggttcg 


11340 


gagggttctt 


gtgctgaagc 


ttatcccgca 


acaaatacag 


ctctcattca 


gaatagaggt 


11400 


tacatggagt 


acatcgaggc 


ggagccgaaa 


aagacgaccc 


gcaaggcagc 


tccaccatcc 


11460 


agttacccag 


gaaatttacc 


gcctttaatg 


acgaagcggg 


acgagaaact 


gggcgttgat 


11520 


tacattgagc 


agggggcgta 


cataatgcac 


gatgatgcaa 


agacgcctgt 


gaatgaggtg 


11580 


gctcctgcca 


tgacccagtc 


gctaactgac 


tcaatcacgc 


tcaatgaact 


ggatgatgac 


11640 


agtatgataa 


tatcccaaac 


ccagccaacg 


acaacggagg 


aaagtgaggc 


actgacggtg 


11700 


gtcaccagtc 


cacttgacac 


gtcctcgccc 


agggttctcg 


atcaatttgc 


atccatgttg 


11760 


gcggcgggaa 


aaggtgactc 


cacacccagt 


agctcagagc 


aacaaccaaa 


gacgtctacg 


11820 


gtgacgagca 


gcagcactgg 


gcccaactcc 


tcgacaacag gaaacgtctc gaaggagccg 


11880 


caggaggagg 


acctgcaaat 


ccagtttgag 


tatgttcgag 


cactgcagca gcggatatcg 


11940 


cagatcagca 


cccaacggcg 


taagagctct 


aagggagagg 


cacctaacct 


gcagctaaac 


12000 


agtagcgcac 


ctgtgataga 


atcagccgag 


gatccggcca 


agcccgcaga 


ggagcctctg 


12060 


gtctcaatgc 


gaccgcggac 


caccagcatt 


tccggaaagg 


taccggagat 


acccacactt 


12120 


agcagcaagc 


tggaagagat 


aaccaaagaa 


cgcactaagc 


aaaaggatct 


gattcacgac 


12180 


ctagtcatgg 


acaagttgca 


gtcgaagaag 


cagctaaacg 


ctgagaagcg 


tctgcaccgg 


12240 


agtcgacagc 


gcagtttgct 


gaccagtggc 


tatgccagtg 


gctccagcct 


tagtccgacg 


12300 


cccaagctgg 


ctgctgcttg 


cagtccgcag 


gattccaact 


gctctagcca 


agcgcactac 


12360 


cacgcctcca 


cggcggagga 


ggccccgaag 


ccgccggcgg 


aaaggccgtt 


gcagaagtcc 


12420 


gccacgtcca 


cctatgtgtc 


gccttatcgc 


actgtccaag 


cgcccacacg 


tagtgctgat 


12480 


ctctataagc 


cgcgcccctt 


cagcgaacac 


atcgattcga 


acgctctggc 


gggttacaag 


12540 


ctcggcaaga 


cggcctcgtt 


taatggcggc 


aagttgggcg 


actttgcgaa 


acccattgcc 


12600 


ccggcgagag 


ttaaccgagg 


aggaggtgtc 


gcgaccgcgg 


atatagccaa 


tatttccgcg 


12660 
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tcgacggaga acctaagaag cgaggccagg gccagggctc gtcttaagtc taacacagag 1272 0 

ctgggcctta gtcccgagga aaagatgcag ctaatacgtt caagattgca ctacgaccaa 1278 0 

aacagatctc tgaagccgaa gcaactggag gagatgccat ccggggatct ggcggcacgt 1284 0 

gcccgcaaaa tgagtgcctc gaagagcgtc aatgatctgg cctacatggt gggacagcag 12 900 

cagcagcagc aggttgagaa ggatgccgtg ctccaagcca aggcggctga ctttacatcc 12 960 

gatcccaatt tggcgtccgg tggtcaggag aaggcaggca aaactaagtc cggacgcagg 13 02 0 

ccaaaggatc cggagcggcg taagagtctc atacagtcgt tgtccagctt cttccaaaag 130 8 0 

ggatctggat ccgcggcctc cagttccaag gagcagggcg gcgctgtggc tgccgtccac 1314 0 

tctgaacagt cagagcgacc aggcaccagc agcagcggca cgcccacaat atcggatgcg 132 00 

gcgggtggag gcggaggagg aggtggcgtc ttcagcagat tccgcatctc gcccaagtcc 13260 

aaggagaagt caaagtcttg ctttgatctg aggaatttcg gttttggtga caaggatatg 13 32 0 

ctggtctgca atgcagcatc tccagcagga gccacatccg catcacagaa aaatcactcg 13380 

caagagtatc tgaacaccac gaacaacagt cgctatcgaa agcaaacgaa cactgcgaaa. 13440 

ccgaaacccg aatcgttctc ttcatccagt ccgcagctct atatacacaa gccccaccac 13500 

ctggccgcag ctcatcccag tgccctggac gaccagacac caccacccat accgcctctt 13560 

ccactgaatt atcagagatc cgatgatgag agctacgcta acgagacacg agagcataag 13 62 0 

aagcaacgtg ccatatcgaa ggcttcacga caagctgagc tcaagcgatt gcgaatcgct 13680 

caagagattc agcgggaaca ggaggagatc gaggtgcaac tgaaggatct ggaggcacgc 1374 0 

ggcgtgctta ttgagaaggc cttgcgaggc gaggcgcaga atattgaaaa cctggatgcg 13800 

acaaaggaca acgacgagaa gctacttaag gaacttttgg agatttggcg caacatcaca 13860 

gcactcaaga aacgcgatga ggaactgact ataaggcaac aggaactgca actggagtat 13 92 0 

cggcatgccc agctgaagga agagctcaat ctgcgcttgt cctgcaacaa actggacaaa 13 980 

agctctgccg atgtggccgc cgagggagca attctcaacg agatgctgga aattgtcgcc 14 04 0 

aagcgagccg ccctacgacc cacagcctcc cagctcgacc tcacggcagc gggatcagca 14100 

tccacgtccg ccgaggcaac gggcattaag ctgacgggac aaccgcatga ccacgaagaa 1416 0 

tcgatcattt ga 14172 

<210> 8 
<211> 4723 
<212> PRT 
<213> Drosophila 

<400> 8 

Met Ser Arg Gin His Gin Arg His His Gin Gin His His His Leu Pro 
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10 15 



Pro His Gin Gin Pro Gin Gin Gin Met Pro Gin Gin Gin Gin Gin Leu 
20 25 30 



Thr Ala Gin Gin Gin Gin Gin Gin Gin Leu Leu Met Ala Glu His Ala 
35 40 45 



Ala Ala Ala Glu Ala Ala Glu Leu Phe Asp Leu Leu Cys Val Ala Thr 
50 55 60 



Thr Met Arg Gin lie Leu Ala Leu His Arg Ala Met Cys Glu Ala Val 
65 70 75 80 



Gly Leu Arg Pro Ser Pro Leu Asn Asp Phe Tyr Pro Arg Leu Lys Ala 
85 90 95 



Lys Val Arg Ser Trp Lys Ala Gin Ala Leu Trp Lys Lys Phe Asp Ala 
100 105 110 



Arg Ala Ala His Arg Val Tyr Gly Lys Gly Ala Ala Cys Thr Gly Thr 
115 120 125 



Arg Val Leu Val He Gly Ala Gly Pro Cys Gly Leu Arg Thr Ala He 
130 135 140 



Glu Ala Gin Leu Leu Gly Ala Lys Val Val Val Leu Glu Lys Arg Asp 
145 150 155 160 



Arg He Thr Arg Asn Asn Val Leu His Leu Trp Pro Phe Val He Thr 
165 170 175 



Asp Leu Arg Asn Leu Gly Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala 
180 185 190 



Gly Ser He Asp His He Ser He Arg Gin Leu Gin Cys Met Leu Leu 
195 200 205 



Lys Val Ala Leu Leu Leu Gly Val Glu He His Glu Gly Val Ser Phe 
210 215 220 



Asp His Ala Val Glu Pro Ser Gly Asp Gly Gly Gly Trp Arg Ala Ala 
225 230 235 240 



Val Thr Pro Ala Asp His Pro Val Ser His Tyr Glu Phe Asp Val Leu 
245 250 255 
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He Gly Ala Asp Gly Lys Arg Asn Met Leu Asp Phe Arg Arg Lys Glu 
260 265 270 



Phe Arg Gly Lys Leu Ala He Ala He Thr Ala Asn Phe He Asn Lys 
275 280 285 



Lys Thr Glu Ala Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe 
290 295 • 300 



He Phe Asn Gin Ala Phe Phe Lys Glu Leu Tyr Gly Lys Thr Gly He 
305 310 315 ~ ' 320 



Asp Leu Glu Asn He Val Tyr Tyr Lys Asp Glu Thr His Tyr Phe Val 
325 330 335 



Met Thr Ala Lys Lys His Ser Leu He Asp Lys Gly Val He He Glu 
340 345 350 



Asp Met Ala Asp Pro Gly Glu Leu Leu Ala Pro Ala Asn Val Asp Thr 
355 360 365 



Gin Lys Leu His Asp Tyr Ala Arg Glu Ala Ala Glu Phe Ser Thr Gin 
370 375 380 



Tyr Gin Met Pro Asn Leu Glu Phe Ala Val Asn His Tyr Gly Lys Pro 
385 390 395 400 



Asp Val Ala Met Phe Asp Phe Thr Ser Met Phe Ala Ala Glu Met Ser 
405 410 415 



Cys Arg Val He Val Arg Lys Gly Ala Arg Leu Met Gin Cys Leu Val 
420 425 430 



Gly Asp Ser Leu Leu Glu Pro Phe Trp Pro Thr Gly Ser Gly Cys Ala 
435 440 445 



Arg Gly Phe Leu Ser Ser Met Asp Ala Ala Tyr Ala He Lys Leu Trp 
450 455 460 



Ser Asn Pro Gin Asn Ser Thr Leu Gly Val Leu Ala Gin Arg Glu Ser 
465 470 475 " 480 



He Tyr Arg Leu Leu Asn Gin Thr Thr Pro Asp Thr Leu Gin Arg Asp 
485 490 495 
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lie Ser Ala Tyr Thr Val Asp Pro Ala Thr Arg Tyr Pro Asn Leu Asn 
500 505 510 



Arg Glu Ser Val Asn Ser Trp Gin Val Lys His Leu Val Asp Thr Asp 
515 520 525 



Asp Pro Ser lie Leu Glu Gin Thr Phe Met Asp Thr His Ala Leu Gin 
530 535 540 



Thr Pro His Leu Asp Thr Pro Gly Arg Arg Lys Arg Arg Ser Gly Asp 
545 550 555 " 560 



Leu Leu Pro Gin Gly Ala Thr Leu Leu Arg Trp lie Ser Ala Gin Leu 
565 570 575 



His Ser Tyr Gin Phe lie Pro Glu Leu Lys Glu Ala Ser Asp Val Phe 
580 585 590 



Arg Asn Gly Arg Val Leu Cys Ala Leu lie Asn Arg Tyr Arg Pro Asp 
595 600 605 



Leu lie Asp Tyr Ala Ala Thr Lys Asp Met Ser Pro Val Glu Cys Asn 
610 615 620 



Glu Leu Ser Phe Ala Val Leu Glu Arg Glu Leu His lie Asp Arg Val 
625 630 635 ~ 640 



Met Ser Ala Lys Gin Ser Leu Asp Leu Thr Glu Leu Glu Ser Arg lie 
645 650 655 



Trp Leu Asn Tyr Leu Asp Gin lie Cys Asp Leu Phe Arg Gly Glu lie' 
660 665 670 



Pro His lie Lys His Pro Lys Met Asp Phe Ser Asp Leu Arg Gin Lys 
675 68 0 685 



Tyr Arg lie Asn His Thr His Ala Gin Pro Asp Phe Ser Lys Leu Leu 
690 695 700 



Ala Thr Lys Pro Lys Ala Lys Ser Pro Met Gin Asp Ala Val Asp lie 
705 710 715 720 



Pro Thr Thr Val Gin Arg Arg Ser Val Leu Glu Glu Glu Arg Ala Lys 
725 730 735 
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Arg Gin Arg Arg His Glu Gin Leu Leu Asn lie Gly Gly Gly Ala Ala 
740 745 ~ 750 

Gly Ala Ala Ala Gly Val Ala Gly Ser Gly Thr Gly Thr Thr Thr Gin 
755 760 765 

Gly Gin Asn Asp Thr Pro Arg Arg Ser Lys Lys Arg Arg Gin Val Asp 
770 775 780 

Lys Thr Ala Asn lie Glu Glu Arg Gin Gin Arg Leu Gin Glu lie Glu 
785 790 795 800 

Glu Asn Arg Gin Glu Arg Met Ser Lys Arg Arg Gin Gin Arg Cys His 
805 810 815 

Gin Thr Gin Asn Phe Tyr Lys Ser Leu Gin Leu Leu Gin Ala Gly Lys 
820 825 830 

Leu Leu Arg Glu Gly Gly Glu Ala Gly Val Ala Glu Asp Gly Thr Pro 
835 840 845 

Phe Glu Asp Tyr Ser He Phe Leu Tyr Arg Gin Gin Ala Pro Val Phe 
850 855 860 

Asn Asp Arg Val Lys Asp Leu Glu Arg Lys Leu Leu Phe Pro Asp Arg 
865 870 875 880 

Glu Arg Gly Asp He Pro Ser Ala Leu Pro Arg Thr Ala Asp Glu Gin 
885 890 895 

Phe Ser Asp Arg lie Lys Asn Met Glu Gin Arg Met Thr Gly Arg Gly 
900 905 910 

Gly Leu Gly Gly Asp Lys Lys Pro Lys Asp Leu Met Arg Ala He Gly 
915 920 925 

Lys He Asp Ser Asn Asp Trp Asn Val Arg Glu lie Glu Lys Lys He 
93 0 935 940 

Glu Leu Ser Lys Lys Thr Glu He His Gly Pro Lys Gly Arg Glu Lys 
945 950 955 J ~ 960 

Val Pro Lys Trp Ser Lys Glu Gin Phe Gin Ala Arg Gin His Lys Met 
965 970 975 



Ser Lys Pro Gin Arg 



Gin Asp Ser Arg Glu Ala Glu Lys Phe Lys Asp 
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980 985 990 

lie Asp Gin Thr lie Arg Asn Leu Asp Lys Gin Leu Lys Glu Gly His 
995 1000 1005 

Asn Leu Asp Val Gly Glu Arg Gly Arg Asn Lys Val Ala Ser lie 
1010 1015 1020 

Ala Gly Gin Phe Gly Lys Lys Asp Glu Ala Asn Ser Asp Glu Lys 
1025 1030 1035 

Asn Ala Gly Ser Ser Asn Ala Thr Thr Asn Thr Asn Asn Thr Val 
1040 1045 1050 

lie Pro Lys Ser Ser Ser Lys Val Ala Leu Ala Phe Lys Lys Gin 
1055 1060 1065 

Ala Ala Ser Glu Lys Cys Arg Phe Cys Lys Gin Thr Val Tyr Pro 
1070 1075 1080 

Met Glu Lys Thr Thr Val Glu Gly Leu Val Leu His Arg Asn Cys 
1085 1090 1095 

Leu Lys Cys His His Cys His Thr Asn Leu Arg Leu Gly Gly Tyr 
1100 1105 1110 

Ala Phe Asp Arg Asp Asp Pro Gin Gly Arg Leu Tyr Cys Thr Gin 
1115 1120 1125 

His Phe Arg Leu Pro Pro Lys Pro Leu Pro Gin Arg Thr Asn Lys 
1130 1135 1140 

Ala Arg Lys Ser Ala Ala Ala Gin Pro Ala Ser Pro Ala Val Pro 
1145 1150 1155 

Pro Thr Ala Gly Ser Val Pro Thr Ala Ala Ala Thr Ser Glu His 
1160 1165 1170 

Met Asp Thr Thr Pro Pro Arg Asp Gin Val Asp Leu Leu Gin Thr 
1175 1180 1185 

Ser Arg Ala Asn Ala Ser Ala Asp Ala Met Ser Asp Asp Glu Ala 
1190 1195 1200 



Asn Val lie Asp Glu His Glu Trp Ser Gly Arg Asn Phe Leu Pro 
1205 1210 1215 
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Glu Ser Asn Asn Asp Ser Gin Ser Glu Leu Ser Ser Ser Asp Glu 
1220 1225 1230 



Ser Asp Thr Glu Ser Asp Ser Glu Met Phe Glu Glu Ala Asp Asp 
1235 1240 1245 



Ser Pro Phe Gly Ala Gin Thr Leu Gin Leu Ala Ser Asp Trp lie 
1250 1255 1260 



Gly Lys Gin Tyr Cys Glu Asp Ser Asp Asp Ser Asp Asp Phe Tyr 
1265 1270 1275 



Asp Ser Ser Glu Asp Asp Gly Lys Asp Asp Thr Glu Gly Glu Glu 
1280 1285 1290 



Phe Lys Lys Ala Arg Glu Leu Arg Arg Gin Glu Val Arg Leu Gin 
1295 1300 1305 



Pro Leu Pro Ala Asn Leu Pro Thr Asp Thr Glu Thr Glu Lys Leu 
1310 1315 1320 



Lys Leu Asn Val Asp Asn Lys Glu Asn Met Ala Asp Gly Ser Ser 
1325 1330 1335 



Leu Lys Ser Gly Asn Ser Phe Glu Ser Ala Arg Ser Gin Pro Ser 
1340 1345 1350 



Thr Pro Leu Ser Thr Pro Thr Arg Val Glu Met Glu Gin Leu Glu 
1355 1360 1365 



Arg Asp Ala Pro Arg Lys Phe Ser Ser Glu lie Glu Ala lie Ser 
1370 1375 1380 



Glu Lys Leu Tyr His Met Asn Asn Met Val Lys Met Asn Lys Asp 
1385 1390 1395 



Leu Glu Val Leu Ala Lys Glu Asn Leu Val Lys Ser Gly lie Leu 
1400 1405 1410 



Arg Lys Leu Thr Leu Lys Glu Lys Trp Leu Ala Glu Asn Ala Ala 
1415 1420 1425 



lie Ala Ala Gly Gin Lys Val Thr Pro Thr Pro Ser Ala Thr Ala 
1430 1435 1440 
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Pro Gly Leu Gin Pro Lys Ser Lys Phe Asp Glu Lys Phe Glu Lys 
1445 1450 1455 



Val Val Ser Pro Pro Gin Pro Val Val Glu Pro Lys Pro Lys Pro 
1460 1465 1470 



Val lie Asp Phe Asn Leu Asp Glu Leu Lys Pro Arg Lys Pro Asn 
1475 1480 1485 



Phe Glu Glu Arg Pro Lys Glu Gin Leu Pro Arg Pro Glu Ser Leu 
1490 1495 1500 



Lys Lys Pro Pro Gin Gin Lys Pro Lys Gly Ser Ser Thr Asn Val 
1505 1510 1515 



Ser Arg Ser Asn Ser Leu Lys Ser Asn Ala Ser Asn Gly Ser Pro 
1520 1525 1530 



Lys Val Lys Lys Ala Pro Val Ser Asn Asn Ser Lys Met Gin lie 
1535 1540 1545 



Glu Gly He Leu Asp Thr Leu Arg Lys He Gin Ser Gin Asn Ser 
1550 1555 1560 



Ser Asp Gin Asp Glu Asp Met Asp Val Asp Glu Asp Val Glu Arg 
1565 1570 1575 



Lys Pro Asn Lys Glu Leu Asn Ser Lys Leu Lys Glu He Gin Ala 
1580 1585 1590 



Ser Ser Phe Ala Gly Thr Met Asp His He Lys Ser Gin Leu Thr 
1595 1600 "* 1605 



Met Pro Thr Val Ser Ala Gin Ala Pro Pro Ser Met Asp Leu Ser 
1610 1615 1620 



Lys Tyr Phe Pro Asn Gin Lys Gin Glu Lys Ser Ser Thr Ser Ser 
1625 1630 1635 



Thr Asn Lys Asn Gin Val Thr Leu Lys Asp Val Asn Leu Ala Lys 
1640 1645 ~ 1650 



Tyr Phe Pro Ser Ser Pro Ala Pro Gin Arg Arg Thr Val Glu Thr 
1655 1660 1665 
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Val Ala Asp Arg Leu Lys Lys Ser Gin Thr Glu Ala Ala Leu Ala 
1670 ^ 1675 1680 



Lys Thr Lys Leu Leu Glu Asp Gin Ala Asn Asn Gin Ala Glu Lys 
1685 1690 1695 



Thr Lys Lys Glu Val Glu Lys Glu Gly Glu Ser Lys Lys lie Thr 
1700 1705 1710 



Lys Lys Val Ala Asp Ser Lys Ala Val Pro Pro Lys Arg Gin Ala 
1715 1720 1725 



Ser Leu Asp Thr Phe Ser Leu Arg Glu His Gin Met Asp Gly Ala 
1730 1735 1740 



Leu Asp Leu Thr Lys Lys Lys Gly Pro Thr Lys Ala Ser Ala Gly 
1745 1750 1755 



Val Lys Lys Pro Ala Lys Ser Gly Ser Thr Thr Ser Val Thr Lys 
1760 1765 1770 



Ala Thr Ala Thr Ser Glu Gly Lys Thr lie Lys lie Val Lys Lys 
1775 1780 1785 



lie Val Pro Lys Gly Thr Lys Ala Lys Lys Ala Ala Glu Ala Ala 
1790 1795 1800 



Gin Glu Ser Ala Val Val Glu Ala Pro Pro Glu Lys Lys Pro Pro 
1805 1810 1815 



Lys Asp Glu Ala Glu Arg lie Leu Asp Glu lie Leu Gly Asp Gly 
1820 1825 1830 



Glu Tyr Arg Ser Pro Ser Ser Glu Tyr Gin Arg Leu Phe Gin Asp 
1835 1840 1845 



Glu Lys Ser Pro Ser Asp Leu Ser Asp Asn lie Asp Arg lie Leu 
1850 1855 1860 



Glu Glu Ser Glu Leu Asp Val Glu Leu Gly Leu Pro Lys Arg Ser 
1865 1870 1875 



Ser Lys Lys Leu Val Lys Thr Lys Ser Leu Gly Glu Gly Asp Phe 
1880 1885 1890 



Asp Met Lys Pro Ser Lys Glu Arg Leu Thr Gly Val Gin Asn lie 
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1895 1900 1905 

Leu Lys Arg Phe Glu Ser Met Ser Ser Val Thr Ser Gin Asn Ser 
1910 1915 1920 

Asp Glu Gin Ala Gly Phe Lys Leu Arg Arg Met Glu Ser Thr Thr 
1925 1930 1935 

Ser Asn Leu Ser Ser Leu Thr Arg Ser Arg Glu Ser Leu Val Ser 
1940 1945 ~ 1950 

Val Ser Asp Ser Met Ser Asp Leu Glu Lys Thr Met Asp Tyr Leu 
1955 I960 1965 

Arg Asn Glu Trp Arg Asn Glu Ala Thr Asn Phe Leu Gin Lys Lys 
1970 1975 1980 

Arg Asp Lys Phe Tyr Ala Lys Lys Glu Glu Gin Glu Lys Glu Ala 
1985 1990 1995 

Lys lie Leu Ala Lys Pro Asp Pro Leu Asp Asn Leu Pro Val Gin 
2000 2005 2010 

Tyr Arg Asp Ser Lys Leu Ala Lys Phe Phe Gly Leu Ala Ala Ser 
2015 2020 * 2025 

Lys Ser Pro Glu Asn Arg Lys Ser Pro lie Lys Lys Lys Lys Ser 
2030 2035 2040 

Pro Ser Lys Thr Pro Lys Val Thr Lys Ala Asn Asn Ser Leu Glu 
2045 2050 2055 

Glu Leu Ala Lys lie Ser Asn Val Arg Gin Thr Lys Lys Ala Gin 
2060 2065 2070 

Pro Lys Thr Leu Lys Pro Val Glu Val Lys Pro Leu Lys Pro Ala 
2075 2080 2085 

Ser Pro Val Pro Asp Asp Phe Glu lie Leu Asp Leu Leu Glu Lys 
2090 2095 2100 

Ala Thr Glu Ala Lys Glu Leu Glu Arg Ser Lys Thr Lys Ser Pro 
2105 2110 2115 



Ala Val Glu Ser lie Ser Gin Thr Pro Lys Glu Ala lie Val Glu 
2120 2125 ' 2130 
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He Ser Leu Pro Val Glu Asp He Lys Asn Leu Pro Lys Thr Gly 
2135 2140 2145 



Cys Asp Lys Ser Ser Asn Ser Ser Arg Arg Gly Ser Gin Ser Ser 
2150 2155 2160 



Leu He Met Ser Arg Arg His Ser Glu He Ser Leu Asn Glu Lys 
2165 2170 2175 



Leu Asn Gin Asp Ala Leu Ala Ala Leu Asn Gin He Glu Lys Glu 
2180 2185 2190 



Arg Glu Ala Glu Gin Val Asp Glu Leu Phe Gin Ser Met Val Glu 
2195 2200 2205 



Glu Met Glu Gin Glu Pro Gin Pro Thr Ala He Val Glu Pro Pro 
2210 2215 2220 



Glu Glu Asp He Asp Ala Asp Ser Leu Cys Thr Thr He Ser Lys 
2225 2230 2235 



Ser Pro Ser Ala Gin Pro Val Thr Val Val Lys Arg Gly Ser Ser 
2240 2245 2250 



Glu Asp Gin Ser He Glu Lys Leu Phe Ser His Phe Ser Asp Glu 
2255 2260 2265 



Met Leu Val Asn Val Glu Phe Asp Ser Asn Asp Glu Leu Val Gly 
2270 2275 ~ 2280 



He Thr Pro Arg Ala Thr Leu Val Ser Arg Asn Thr Glu Asp Arg 
2285 2290 2295 



Asp Tyr Leu Asp Lys Leu Glu Ser Leu Glu Arg Asp Glu Glu Thr 
2300 2305 2310 



Phe Gin Pro Val Val Gly Glu Lys Phe He Gin Glu Asn Val Gin 
2315 2320 2325 



Asp Glu Val Asp Gly Leu His Phe Pro Ser Arg Pro Gin Arg Arg 
2330 2335 2340 



Pro Lys Ser Ser Ser Ser Ser Ser Glu Pro Ser Leu Pro Val Ala 
2345 2350 2355 
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Pro Gin Arg Leu Glu Lys Lys Leu Ser Lys Leu Asp Pro Glu Asp 
2360 2365 2370 



Met Pro Pro Ser Val Gin Asp Leu Leu Gin Gin Val Tyr Gin Lys 
2375 2380 2385 



Asn lie Gin Pro Glu Leu Val Glu Val lie Pro Val Glu Gly Lys 
2390 2395 2400 



Gin Thr Leu Arg Phe Pro Ser Met Leu Ala Glu Glu Asp Val Asp 
2405 2410 2415 



Glu Val Asp His Ser Lys Glu Gly He Lys Lys He Glu Thr Ala 
2420 2425 2430 



Pro Glu Glu Val Arg Lys Val Thr Glu Pro Glu Asp Val Ala Arg 
2435 2440 2445 



Val He Pro Ser Pro He Lys Pro Ser He Ser Gin Ser Asn Ser 
2450 2455 2460 



Leu. Lys Ser Glu Asn Ser Ser Gly Ser Ser Leu Val Glu He Pro 
2465 2470 2475 



Lys He He Ala Pro Pro Lys Ser Ser Ser Lys Glu Asn Ser Ser 
2480 2485 2490 



Asp Trp Asp Arg Glu Lys Leu Pro Ala Ser Pro Met Pro Arg Arg 
2495 2500 2505 



Arg Leu Leu Pro Asn Gin Thr Pro Tyr Lys Ala Pro Ser Val Ala 
2510 2515 2520 



Ser Lys Glu Ser Ser Leu Glu Trp Asp Met Glu Lys Leu Pro Asn 
2525 2530 2535 



Ser Pro Met Leu Pro Arg Arg Asn Lys Met Arg Ala He Ser Pro 
254 0 2545 2550 



Ser Thr Asn Pro Val Gin Leu Leu Asn Asn Leu Pro Ser Asp Val 
2555 2560 2565 



Asp Asp Glu Ala Ala Gin Arg Arg Leu He Glu Asp Phe Glu Gin 
2570 2575 2580 
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Glu Arg Arg Gin Ala Leu lie Lys Arg Asp Glu Asn Phe Glu Ala 
2585 2590 2595 



lie Ala Ala Glu Gin Arg Arg Arg Asp Ser Leu Gin Ser Ser Ser 
2600 2605 2610 



Asn Ser Ser Ser Lys Arg Ser Leu Pro Pro Pro Thr Pro Pro Met 
2615 . 2620 2625 



Met Ala Ser Arg Arg Gly Thr Thr Gin Asp Thr Asn Arg Thr Gin 
2630 < 2635 2640 



Asp Thr Ala Ser Arg His Glu Gly Thr Pro Pro Met Phe Lys Lys 
2645 2650 2655 



Leu Asp Val Asp Gly Ser Gly Thr Ser Met Asp Ser Thr Ser Cys 
2660 2665 2670 



Ser Thr Arg Arg Ser Ser Phe Ala Phe lie Glu Leu Gin Asp Asn 
2675 2680 2685 



Lys Pro Val lie Val Pro Met Pro Lys Lys Leu Lys Leu Pro Lys 
2690 2695 2700 



Pro Glu Pro Pro Arg Phe Val Pro Glu Pro Val Ala Thr Asp Glu 
2705 2710 2715 



Pro Val Pro Glu Val Phe Gin Gly Arg Ala Trp Pro Lys Thr Gin 
2720 2725 2730 



Leu Glu Gly Glu Val Asp Leu Gly Asp Ser Asp Asn Glu Asp Glu 
2735 2740 2745 



Thr Glu Lys Leu Lys Lys Gin Leu Pro Glu Tyr Ala Arg Ser Asp 
2750 2755 ~ 2760 



Ser Pro Pro Ser Ala Ala Phe Lys Asn Arg Lys Trp Pro Asp Gly 
2765 2770 2775 



Lys Thr Val Phe Asp Lys Arg Ala Glu Ser Leu Glu Glu Glu Asp 
2780 2785 2790 



He Phe Glu Gly Leu Pro Ser Pro Arg Lys Arg Gly Ser Gin Arg 
2795 2800 2805 



Phe Met Asp Lys Pro Arg Ser Gin Ser Pro Gin Pro Phe Lys Pro 
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2810 2815 2820 



Leu Ala Asn Ser Ser Arg Lys Ser Ser Lys Ser Phe Ser Asp Leu 
2825 2830 2835 



Lys Lys Gly Pro Ser Leu Gin Ser Leu Ser Ala Gin Ser Ser Gin 
2840 2845 2850 



Asp Thr Asp Thr Leu Ser Thr Thr Thr Thr Val Ala Thr Ala Arg 
2855 2860 2865 



Pro Ala Ser Tyr Ala Asn Tyr Glu Asp Pro Met Asp Ala Ser Thr 
2870 2875 2880 



Gin Ala Leu Leu Asp Arg Ser Lys Arg Leu His Asn Arg Lys Arg 
2885 2890 2895 



Asp Phe Val Asn Glu Arg Val Val Glu Arg Asn Pro Tyr Met Arg 
2900 2905 2910 



Asp Val Leu Arg Ser Thr Asp Arg Arg Asp Tyr Asp Asp Val Asp 
2915 2920 2925 



Glu Asp Leu Thr Ser Tyr Arg Pro Arg His Tyr Ala Ser Ser Thr 
2930 2935 2940 



Leu Asn Arg Phe Pro Asn Thr Thr lie Arg Lys Ser Asn Asn Tyr 
2945 2950 2955 



Asp Tyr Leu Ser Pro Ser Ser Asp Tyr Leu Ser Arg Arg Ser Tyr 
2960 2965 2970 



lie Pro Ser Ala Ser Ala Thr Ser Ser Tyr Tyr Pro Ser Thr Thr 
2975 2980 2985 



Arg Ser Ser His Leu Ser Asp Leu Phe Arg Arg Arg Ser Pro Ala 
2990 2995 3000 



Ser Gly Thr Val Ser Ala Leu Ser Gly Tyr Gly Asn Lys Glu Ser 
3005 3010 3015 



Cys Val He Ser He Gly Leu Ala Leu Asp Arg Val Gly His Leu 
3020 3025 3030 



He Glu Ser Lys Cys Thr Trp Val Arg Ser Thr Lys Val Gin Thr 
3035 3040 3045 
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Glu Ser Glu Ser Thr Ser Pro Asp Glu Val Glu Leu Asn Ser Ala 
3050 3055 3060 



Thr Glu lie Ser Thr Asp Ser Glu Phe Asp Asn Asp Glu lie lie 
3065 3070 3075 



Arg Gin Ala Pro Lys lie Phe lie Asp Asp Thr His Leu Arg Lys 
3080 3085 3090 



Pro Thr Lys Val Gin lie Lys Ser Thr Met lie Gly Pro Asn Ala 
3095 3100 3105 



Ala Ser Ala Gly Leu His Gin Lys Gin Leu Ala Ala Arg Glu Lys 
3110 3115 3120 



Gly Gly Ser Tyr Leu Gin Lys Tyr Gin Pro Gin Pro Pro Leu Pro 
3125 3130 3135 



Gin Phe Arg Pro Leu Val Gin Val Asp Pro Thr Leu Leu lie Gly 
3140 3145 3150 



Ser Gin Arg Ala Pro Leu Gin Asn Pro Arg Pro Gly Asp Tyr Leu 
3155 3160 3165 



Leu Asn Lys Thr Ala Ser Thr Glu Gly lie Ala Ser Lys Lys Ser 
3170 3175 3180 



Leu Gly Leu Lys Lys Arg Tyr Leu Leu Gly Glu Pro Ala Asn Gly 
3185 3190 3195 



Asn Lys lie Gin Lys Ser Gly Ser Thr Ser Val Leu Asp Ser Arg 
3200 3205 3210 



lie Arg Ser Phe Gin Ser Asn lie Ser Glu Cys Gin Lys Leu Leu 
3215 3220 3225 



Asn Pro Ser Ser Asp lie Ser Ala Gly Met Arg Thr Phe Leu Asp 
3230 3235 3240 



Arg Thr Lys Leu Gly Glu Gly Ser Gin Thr Thr Pro Gly Gin Thr 
3245 3250 3255 



Asn Glu Leu lie Arg Ser Ala Thr Ser Asn Val lie Asn Asp Leu 
3260 3265 3270 
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Arg Val Glu Leu Arg lie Gin Lys Thr Gly Ser Ser His Ser Thr 
3275 3280 3285 



Asp Asn Glu Lys Glu Asn Val Phe Val Asn Cys Lys Asn Glu Leu 
3290 3295 3300 



Asn Lys Gly Met Glu Tyr Thr Asp Ala Val Asn Ala Thr Leu Leu 
3305 3310 3315 



Asp Gin Leu Ala Arg Lys Ser Ser Pro Thr Thr Pro Thr Asn Lys 
3320 3325 3330 



Thr Val Val Glu Val lie Asp Leu Val Thr Pro Glu Lys Pro He 
3335 3340 3345 



Asp He He Asp Leu Thr Ala Leu Glu Thr Pro Lys Lys Gin Leu 
3350 3355 3360 



Val Asp Gly Ser Ala Met Asp Val Asp Glu Arg Leu Thr Pro Asp 
3365 3370 3375 



Ser Asn Lys He Ser Glu Leu Gin Gin Glu Val Lys Glu Glu Pro 
3380 3385 3390 



Lys Pro Asp Val Ser Arg Asp Val Lys Glu Cys He Pro Asp He 
3395 3400 3405 



Leu Gly His He Lys Glu Gly Thr Gly Ser Lys Glu Pro Gly Gly 
3410 3415 ^ 3420 



Glu Asp Gin Gin Ser Leu Leu Glu Gin Ser Asp Glu Glu Lys Arg 
3425 3430 3435 



Asp Ser Pro Glu Lys Asp Val Ala Glu His Glu Leu Tyr Glu Pro 
3440 3445 3450 



Asp Ser Val Gin He Gin Val Pro Asn He Pro Trp Glu Lys Ser 
3455 3460 3465 



Lys Pro Glu Val Met Ser Thr Thr Gly Ser Ser Gly Ser He Cys 
3470 3475 3480 



Ser Ser Ser Asp Ser Ser Ser He Glu Asp He Gin His Tyr He 
3485 3490 3495 
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Leu Glu Ser Thr Thr Ser Pro Asp Thr Gin Thr Val Gly Gly Lys 
3500 3505 3510 

His Asn Val Pro Arg Leu Glu Val His Asp Thr Ser Gly Ala Leu 
3515 3520 3525 



Met Gin Val Asp Ser Leu Met lie Val Asn Gly Lys Tyr lie Gly 
3530 3535 3540 



Asp Pro Glu Asp Val Lys Phe Leu Asp Met Pro Ala Asn Val lie 
3545 3550 3555 



Val Pro Pro Ala Pro Ala Leu Lys Thr Asn Glu Leu Asp Met Glu 
3560 3565 3570 



Asp Asp Gin Glu Ala Glu Ala Glu Pro Val Thr Ala Thr Pro Glu 
3575 3580 3585 



Pro Val Glu Cys Thr Val He Glu Ala Glu Arg Arg Val Thr Ala 
3590 3595 3600 



Pro Pro Pro Leu Pro Glu Met Gly Pro Pro Lys Leu Lys Phe Asp 
3605 3610 3615 



Ser Lys Asn Glu Asn Lys He Glu Ser Leu Lys Asn Leu Pro Leu 
3620 3625 3630 



He Val Glu Ser Asn Val Glu His Ser Gin Ala Val Lys Pro He 
3635 3640 3645 



Thr Leu Asn Leu Ser Asn Leu Ala Arg Thr Pro Asp Thr Pro Thr 
3650 3655 3660 



Thr Pro Thr Ala His Asp Ser Asp Lys Thr Pro Thr Gly Glu He 
3665 3670 3675 



Leu Ser Arg Gly Ser Asp Ser Glu Thr Glu His Thr Gly Thr Gly 
3680 3685 3690 



Gin Val Leu Thr Glu Thr Glu Leu Ser Asp Trp Thr Ala Asp Asp 
3695 3700 3705 



Cys He Ser Glu Asn Phe Val Asp Leu Glu Phe Ala Leu Asn Ser 
3710 3715 3720 



Asn Lys Gly Thr He Lys Arg Arg Lys Asp Arg Arg Arg Ser Gly 
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3725 3730 3735 



Ala Ser Lys Leu Pro Ser Gly Asn Glu Val lie His Glu Leu Ala 
3740 3745 3750 



Arg Gin Ala Pro Val Val Gin Met Asp Gly lie Leu Ser Ala He 
3755 3760 3765 



Asp He Asp Asp He Glu Phe Met Asp Thr Gly Ser Glu Gly Ser 
3770 3775 3780 



Cys Ala Glu Ala Tyr Pro Ala Thr Asn Thr Ala Leu He Gin Asn 
3785 3790 3795 



Arg Gly Tyr Met Glu Tyr He Glu Ala Glu Pro Lys Lys Thr Thr 
3800 3805 3810 



Arg Lys Ala Ala Pro Pro Ser Ser Tyr Pro Gly Asn Leu Pro Pro 
3815 3820 3825 



Leu Met Thr Lys Arg Asp Glu Lys Leu Gly Val Asp Tyr He Glu 
3830 3835 3840 



Gin Gly Ala Tyr He Met His Asp Asp Ala Lys Thr Pro Val Asn 
3845 3850 ' 3855 



Glu Val Ala Pro Ala Met Thr Gin Ser Leu Thr Asp Ser He Thr 
3860 3865 3870 



Leu Asn Glu Leu Asp Asp Asp Ser Met He He Ser Gin Thr Gin 
3875 3880 3885 



Pro Thr Thr Thr Glu Glu Ser Glu Ala Leu Thr Val Val Thr Ser 
3890 3895 3900 



Pro Leu Asp Thr Ser Ser Pro Arg Val Leu Asp Gin Phe Ala Ser 
3905 3910 3915 



Met Leu Ala Ala Gly Lys Gly Asp Ser Thr Pro Ser Ser Ser Glu 
3920 3925 3930 



Gin Gin Pro Lys Thr Ser Thr Val Thr Ser Ser Ser Thr Gly Pro 
3935 3940 3945 



Asn Ser Ser Thr Thr Gly Asn Val Ser Lys Glu Pro Gin Glu Glu 
3950 3955 3960 
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Asp Leu Gin lie Gin Phe Glu Tyr Val Arg Ala Leu Gin Gin Arg 
3965 3970 3975 



lie Ser Gin lie Ser Thr Gin Arg Arg Lys Ser Ser Lys Gly Glu 
3980 3985 3990 



Ala Pro Asn Leu Gin Leu Asn Ser Ser Ala Pro Val lie Glu Ser 
3995 4000 4005 



Ala Glu Asp Pro Ala Lys Pro Ala Glu Glu Pro Leu Val Ser Met 
4010 4015 4020 



Arg Pro Arg Thr Thr Ser lie Ser Gly Lys Val Pro Glu He Pro 
4025 4030 4035 



Thr Leu Ser Ser Lys Leu Glu Glu He Thr Lys Glu Arg Thr Lys 
4040 4045 ~ 4050 



Gin Lys Asp Leu He His Asp Leu Val Met Asp Lys Leu Gin Ser 
4055 4060 4065 



Lys Lys Gin Leu Asn Ala Glu Lys Arg Leu His Arg Ser Arg Gin 
4070 4075 4080 



Arg Ser Leu Leu Thr Ser Gly Tyr Ala Ser Gly Ser Ser Leu Ser 
4085 4090 4095 



Pro Thr Pro Lys Leu Ala Ala Ala Cys Ser Pro Gin Asp Ser Asn 
4100 4105 4110 



Cys Ser Ser Gin Ala His Tyr His Ala Ser Thr Ala Glu Glu Ala 
4H5 4120 4125 



Pro Lys Pro Pro Ala Glu Arg Pro Leu Gin Lys Ser Ala Thr Ser 
4130 4135 4140 



Thr Tyr Val Ser Pro Tyr Arg Thr Val Gin Ala Pro Thr Arg Ser 
4145 4150 4155 



Ala Asp Leu Tyr Lys Pro Arg Pro Phe Ser Glu His He Asp Ser 
4160 4165 4170 



Asn Ala Leu Ala Gly Tyr Lys Leu Gly Lys Thr Ala Ser Phe Asn 
4175 4180 4185 
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Gly Gly Lys Leu Gly Asp Phe Ala Lys Pro lie Ala Pro Ala Arg 
4190 4195 4200 



Val Asn Arg Gly Gly Gly Val Ala Thr Ala Asp lie Ala Asn lie 
4205 4210 4215 



Ser Ala Ser Thr Glu Asn Leu Arg Ser Glu Ala Arg Ala Arg Ala 
4220 4225 4230 



Arg Leu Lys Ser Asn Thr Glu Leu Gly Leu Ser Pro Glu Glu Lys 
4235 4240 4245 



Met Gin Leu lie Arg Ser Arg Leu His Tyr Asp Gin Asn Arg Ser 
4250 4255 4260 



Leu Lys Pro Lys Gin Leu Glu Glu Met Pro Ser Gly Asp Leu Ala 
4265 4270 4275 



Ala Arg Ala Arg Lys Met Ser Ala Ser Lys Ser Val Asn Asp Leu 
4280 4285 4290 



Ala Tyr Met Val Gly Gin Gin Gin Gin Gin Gin Val Glu Lys Asp 
4295 4300 4305 



Ala Val Leu Gin Ala Lys Ala Ala Asp Phe Thr Ser Asp Pro Asn 
4310 4315 4320 



Leu Ala Ser Gly Gly Gin Glu Lys Ala Gly Lys Thr Lys Ser Gly 
4325 4330 ~ " 4335 



Arg Arg Pro Lys Asp Pro Glu Arg Arg Lys Ser Leu lie Gin Ser 
4340 4345 4350 



Leu Ser Ser Phe Phe Gin Lys Gly Ser Gly Ser Ala Ala Ser Ser 
4355 4360 4365 



Ser Lys Glu Gin Gly Gly Ala Val Ala Ala Val His Ser Glu Gin 
4370 4375 4380 



Ser Glu Arg Pro Gly Thr Ser Ser Ser Gly Thr Pro Thr lie Ser 
4385 4390 4395 



Asp Ala Ala Gly Gly Gly Gly Gly Gly Gly Gly Val Phe Ser Arg 
4400 4405 4410 
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Phe Arg lie Ser Pro Lys Ser Lys Glu Lys Ser Lys Ser Cys Phe 
4415 4420 4425 

Asp Leu Arg Asn Phe Gly Phe Gly Asp Lys Asp Met Leu Val Cys 
4430 4435 4440 

Asn Ala Ala Ser Pro Ala Gly Ala Thr Ser Ala Ser Gin Lys Asn 
4445 4450 4455 

His Ser Gin Glu Tyr Leu Asn Thr Thr Asn Asn Ser Arg Tyr Arg 
4460 4465 4470 

Lys Gin Thr Asn Thr Ala Lys Pro Lys Pro Glu Ser Phe Ser Ser 
4475 4480 4485 

Ser Ser Pro Gin Leu Tyr lie His Lys Pro His His Leu Ala Ala 
4490 4495 4500 

Ala His Pro Ser Ala Leu Asp Asp Gin Thr Pro Pro Pro lie Pro 
4505 4510 4515 

Pro Leu Pro Leu Asn Tyr Gin Arg Ser Asp Asp Glu Ser Tyr Ala 
452 0 4525 " 453 0 

Asn Glu Thr Arg Glu His Lys Lys Gin Arg Ala lie Ser Lys Ala 
4535 4540 4545 

Ser Arg Gin Ala Glu Leu Lys Arg Leu Arg lie Ala Gin Glu lie 
4550 4555 4560 

Gin Arg Glu Gin Glu Glu lie Glu Val Gin Leu Lys Asp Leu Glu 
4565 4570 4575 

Ala Arg Gly Val Leu He Glu Lys Ala Leu Arg Gly Glu Ala Gin 
4580 4585 4590 

Asn He Glu Asn Leu Asp Ala Thr Lys Asp Asn Asp Glu Lys Leu 
4595 4600 4605 

Leu Lys Glu Leu Leu Glu He Trp Arg Asn He Thr Ala Leu Lys 
4610 4615 4620 

Lys Arg Asp Glu Glu Leu Thr He Arg Gin Gin Glu Leu Gin Leu 
4625 4630 4635 



Glu Tyr Arg His Ala Gin Leu Lys Glu Glu Leu Asn Leu Arg Leu 
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4640 4645 4650 

Ser Cys Asn Lys Leu Asp Lys Ser Ser Ala Asp Val Ala Ala Glu 
4655 4660 4665 

Gly Ala lie Leu Asn Glu Met Leu Glu lie Val Ala Lys Arg Ala 
4670 4675 4680 

Ala Leu Arg Pro Thr Ala Ser Gin Leu Asp Leu Thr Ala Ala Gly 
4685 4690 4695 

Ser Ala Ser Thr Ser Ala Glu Ala Thr Gly He Lys Leu Thr Gly 
4700 4705 ~ L 4710 

Gin Pro His Asp His Glu Glu Ser He lie 



4715 




4720 








<210> 9 
<211> 9009 
<212> DNA 
<213> Drosophila 










<400> 9 
atgagccgcc 


aacaccagcg 


gcaccaccag cagcatcacc 


acctgccgcc 


gcaccagcaa 


60 


ccgcagcagc 


agatgccgca 


acaacagcag cagctgacgg 


cgcagcagca 


gcaacaacag 


120 


cagctgctga 


tggcggagca 


cgcggcggcc gcggaggcgg 


cggagctatt 


cgacctgctg 


180 


tgcgtggcca 


caacgatgcg ccagatcctg gcgctccatc gggccatgtg cgaggctgtg 


240 


ggattgagac 


cctcgcctct 


gaacgacttc tacccacggc 


taaaggccaa 


ggtgcgttcg 


300 


tggaaggcgc 


aggccctgtg 


gaagaagttc gacgccagag 


ctgcccatag 


agtctacggc 


360 


aagggagctg 


cctgtactgg 


cacacgcgtc ctggtcatcg 


gagcagggcc 


ctgtggactg 


420 


cgcaccgcca 


tcgaggccca 


actgctgggc gccaaggtgg 


tggtgctgga 


gaaacgcgat 


480 


cgcatcaccc 


ggaacaatgt gctccatctg tggccattcg tcatcacgga tctgcgcaac 


540 


ttgggcgcaa 


agaagttcta 


cggcaagttt tgcgccggct 


ccatcgatca 


catctccatt 


600 


cggcagctgc 


agtgcatgct 


gctcaaggtg gcgctgctcc 


tgggcgtaga 


gatccacgag 


660 


ggagtcagtt 


ttgatcacgc 


tgtagagccc tctggcgatg gcggcggatg gagggcagct 


720 


gttactcccg 


cagatcatcc 


tgtatctcac tacgaattcg 


atgtgttgat 


cggagcggat 


780 


ggcaagcgga 


atatgctgga ctttaggagg aaggagttcc 


gcgggaagct 


ggccatcgct 


840 


attacagcga 


actttatcaa 


caagaagacg gaggcggagg 


ctaaagtaga 


ggagatcagt 


900 


ggggtggctt 


tcatcttcaa 


ccaggccttc ttcaaggagc 


tgtacgggaa 


gacgggcatc 


960 
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gacctggaaa acatcgtcta ctacaaggac gagacgcact acttcgtgat gacggccaag 102 0 

aagcacagtc taattgacaa gggcgttatt atcgaggata tggccgatcc cggcgagctt 108 0 

ctcgccccag ccaatgtgga tacacaaaag ctgcacgact atgcacgcga ggctgcggag 114 0 

ttctccaccc aataccaaat gccaaacctg gagttcgctg ttaatcacta cggcaaacca 12 0 0 

gatgtggcca tgttcgactt cacatcgatg tttgccgccg agatgtcctg tcgggtgatt 1260 

gtgcgcaaag gagctcgcct gatgcagtgc ctcgtgggtg acagtctgct cgagccgttt 132 0 

tggcccactg gatcgggttg tgcccgtgga ttcttatcca gcatggatgc tgcctatgcc 13 80 

atcaagcttt ggtccaaccc gcagaacagc acacttggcg ttctggcgca gcgcgaaagc 144 0 

atctaccggc tgcttaacca gaccacgccg gacaccctgc agcgggacat cagtgcctat 1500 

accgtggatc cggccacgcg ctatccgaat ctgaacaggg agtcggtcaa tagctggcag 1560 

gtcaaacatc tggtcgacac ggacgacccg tccattctgg agcagacctt catggacacg 162 0 

catgctctgc agaccccgca tttggacaca ccgggcagac gcaagcgacg cagtggagac 168 0 

ttgctgcccc agggtgccac gttgctgaga tggataagtg cccagctgca ttcctatcag 1740 

tttattcccg aactcaagga ggcttcggat gtgttccgga atggacgcgt tctgtgtgcg 1800 

cttatcaatc gctatcgtcc tgatctcatc gactacgctg ccaccaagga catgagtccc 1860 

gtggagtgca atgagctgtc attcgccgtc ctagagcgcg aactccacat cgatcgcgtc 192 0 

atgagtgcca aacagtcgct ggacttgacc gagctggagt cgcgaatctg gctcaactat 198 0 

ttggaccaga tctgcgactt gtttcgcggc gagatccccc atatcaagca ccccaagatg 2040 

gactttagcg atttgcgcca gaagtatcgt atcaaccata cgcatgccca acccgacttc 2100 

tccaagctgc tggcaacgaa acccaaggcc aagtcgccga tgcaggatgc tgtggacata 2160 

cccacgacag tgcagcggcg ctcggtgctc gaggaggagc gagccaagcg gcagcgtcgc 222 0 

cacgagcagc ttcttaacat cggtggaggg gcagcaggag ccgccgccgg agttgccgga 2280 

agcgggacag gaaccacaac gcagggtcaa aacgatacgc cacgccggtc caagaagcgc 2340 

cgtcaggttg acaaaaccgc caatattgag gagcgccagc agcgcttgca ggagatcgag 2400 

gagaatcggc aggagcggat gagcaagcgg cgccagcagc gctgtcacca gacgcagaat 2460 

ttctacaaga gccttcagct cctgcaggcg ggcaagctct tgagggaggg tggtgaggcg 2520 

ggagtggccg aggatggcac cccattcgag gactactcga tattcctcta ccgccagcag 2580 

gcgcccgtat tcaatgatcg cgtcaaggac ctcgagcgaa agcttctgtt tcccgatcgc 2 64 0 

gaacggggag atattccgtc ggcattgccg cgcacggcgg acgagcagtt cagcgatcgg 2700 

attaaaaaca tggagcagcg gatgacggga cgtggtggcc ttggtggtga caagaagccc 2760 

aaggatctga tgcgggctat cggcaagatc gactcgaacg attggaatgt acgcgagatc 2820 
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gagaagaaga 


tcgagctgtc 


gaagaagacg 


gagatccacg 


ggcctaaggg 


ccgcgagaag 


2880 


gttcccaagt 


ggagcaagga 


gcagttccag 


gcgcgacagc 


acaagatgtc 


caagccgcaa 


2940 


cgccaggatt 


cgcgtgaggc 


ggaaaagttc 


aaggacatcg 


accagactat 


ccgcaacttg 


3000 


gacaagcagc 


taaaggaggg 


ccacaatctg 


gatgtgggcg 


agcggggacg 


caacaaagtg 


3060 


gcctccattg 


ccggtcagtt 


tggcaaaaag 


gatgaggcca 


attcggatga 


gaagaacgcg 


3120 


ggcagcagca 


atgccaccac 


caacaccaac 


aacacagtca 


tacccaaatc 


tagttccaag 


3180 


gtggcgctgg 


cctttaaaaa 


gcaggctgcc 


tccgaaaagt 


gccgcttctg 


taagcaaacc 


3240 


gtctacctga 


tggagaagac 


caccgtggag 


ggattggttc 


tgcatcgcaa 


ttgccttaag 


3300 


tgccaccact 


gccacaccaa 


cttgcgtctg 


ggaggctacg 


cctttgatcg 


ggacgatccg 


3360 


cagggccgat 


tttactgcac 


ccaacacttc 


cggttgccac 


ccaaaccgct 


gccgcagcgc 


3420 


accaacaaag 


ccaggaaatc 


cgctgccgct 


caacccgcct 


cgcctgctgt 


accaccaact 


3480 


gcgggatccg 


tacccactgc 


agctgccaca 


tcggagcata 


tggacaccac 


tccacccagg 


3540 


gaccaggtgg 


acctactgga 


gacctcgcga 


gcaaatgcct 


ctgccgatgc 


catgtccgat 


3600 


gatgaagcca 


atgttatcga 


tgagcacgaa 


tggtctggtc 


gcaacttctt 


gcccgagtcc 


3660 


aacaacgatt 


cccaatcaga 


gctatccagt 


tcagatgagt 


cggatacgga 


atcggattcg 


3720 


gagatgtttg 


aggaggcgga 


tgattcgccg 


tttggtgctc 


agaccctcca 


gctggcgtcg 


3780 


gattggattg 


gaaagcaata 


ctgtgaggac 


agtgatgatt 


ctgacgattt 


ctacgactca 


3840 


agtgaaggta 


ttgcggatga 


cggcaaagat 


gacaccgagg 


gtgaggaatt 


caagaaggcc 


3900 


cgcgaattga 


ggcgccagga 


agttcgcctg 


cagccgttgc 


ccgccaatct 


gcccacagat 


3960 


acggagaccg 


aggttcaaac 


cgagtccgag 


agcacttcac 


ccgacgaagt 


ggagctcaat 


4020 


tctgccactg 


agatatccac 


cgactctgag 


tttgacaacg 


atgagattat 


acgccaggcg 


4080 


cccaaaatct 


tcatcgatga 


cacccatcta 


aggaagccca 


ccaaggttca 


gatcaagtcc 


4140 


accatgatcg 


gacccaatgc 


agcttccgcc 


ggactccatc 


agaagcagtt 


ggcggcgcgt 


4200 


gagaagggcg 


gcagctacct 


ccagaagtac 


caaccacaac 


cgccactgtc 


acagtttaaa 


4260 


ccgttggtcc 


aggtggatcc 


caccctgctc 


attggcagcc 


agcgcgctcc 


tcttcagaat 


4320 


ccacggccag 


gagactactt 


gctaaacaag 


acggccagta 


cggagggtat 


cgcctcaaaa 


4380 


aagagcctgg 


agctaaaaaa 


gcgctatctg 


ctgggtgagc 


cggccaatgg 


cgataagatc 


4440 


cagaagtccg 


gatccacttc 


agtgctggat 


tcacgcattc 


gcagcttcca 


gtcgaacata 


4500 


tcggagtgcc 


agaagctttt 


gaatcccagc 


agcgacataa 


gtgccggcat 


gcgaaccttc 


4560 


ctcgatcgca 


caaagttggg 


cgaaggcagc 


cagacgacac 


ccggacagac 


gaacgaacta 


4620 
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atccgttccg ccaccagcaa tgtgattaac gatctgcgcg tggagcttcg gatacagaaa 


4680 


actgactcca 


gccactccac 


ggacaacgag aaggaaaacg 


ttttcgtgaa 


ctgtaagaac 


4740 


gagctgaaca 


aggggatgga 


atacacggat gcggtcaatg 


ccacgctgct 


ggaccagctg 


4800 


gccagaaaaa 


gttcacccac 


cacgccgacg aataagacgg 


tggtcgaggt 


tattgacctg 


4860 


gttacacctg 


agaagccaat 


tgacattatc gatctaacgg 


cactggaaac 


gccgaaaaag 


4920 


cagttggtcg 


atggtagcgc 


catggatgta gatgaacgcc 


tcacacccga 


tagcaacaaa 


4980 


atcagcgaac 


tgcagcagga 


agtgaaggag gaacccaagc 


cggatgtctc 


tagggatgtg 


5040 


aaagaatgca 


taccagatat 


actgggacac attaaggagg 


gaacgggatc 


gaaggagcca 


5100 


ggtggagagg 


accaacagag 


cctgctggag cagtcggacg 


aagagaagcg 


cgactcaccg 


5160 


gaaaaggatg 


tggccgaaca 


tgagctttat gaaccggaca 


gtgtgcagat 


ccaggtgccc 


5220 


aatatcccat 


gggaaaaaag 


caagccggag gtcatgtcta 


ccaccggcag 


cagtggctcc 


5280 


atctgctcaa 


gctcagactc 


ttctagtatt gaagacatcc 


agcactacat 


tttggagtcc 


5340 


acaactagtc 


cagatactca 


gacagttggc ggaaagcaca 


atgtgccccg 


tttggaggtg 


5400 


cacgacacaa 


gtggtgccct 


gatgcaggtg gacagcctga 


tgattgtgaa 


cggaaagtat 


5460 


attggggatc 


ccgaggatgt 


caagttcttg gatatgccgg 


ccaatgttat 


tgttccgcca 


5520 


gcaccggcgc 


ttaaaacgaa 


tgagctggat atggaggatg 


accaagaggc 


ggaggcggaa 


5580 


ccagtaactg ctactccgga gccggtggaa tgtacggtca 


tcgaggctga 


gcgccgtgtt 


5640 


actgctcccc 


ctcctttgcc 


agagatgggt ccacccaaac 


tgaagttcga 


tagcaaaaat 


5700 


gagaacaaga 


tcgagagctt 


gaagaatctt ccgttgatcg 


tagagagcaa 


tgtggagcac 


5760 


agtcaggcag 


tgaaacccat 


tactcttaac ttaagcaatc 


tggccaggac 


gccggataca 


5820 


ccaaccacgc 


ccacggcgca 


cgatagcgat aaaacaccca 


ctggggaaat 


tctctcgcga 


5880 


ggatctgact 


cagaaaccga 


gcacactggc actggtcagg 


tactaacgga 


gacggaactc 


5940 


tccgactgga 


cggccgacga 


ctgtatctcg gagaactttg 


ttgacttgga 


gttcgcgctt 


6000 


aactctaaca 


agggtacgat 


aaaacggcgc aaggatcgac 


gacgcagtgg 


agcaagcaaa 


6060 


cttcccagtg 


gcaacgaggt 


aatccacgag ctggccaggc 


aggcgccagt 


ggtgcaaatg 


6120 


gatggaattc 


ttagtgccat 


cgacattgat gacatagagt 


tcatggacac 


gggttcggag 


6180 


ggttcttgtg 


ctgaagctta 


tcccgcaaca aatacagctc 


tcattcagaa 


tagaggttac 


6240 


atggagtaca 


tcgaggcgga gccgaaaaag acgacccgca 


aggcagctcc 


accatccagt 


6300 


tacccaggaa atttaccgcc 


tttaatgacg aagcgggacg 


agaaactggg 


cgttgattac 


6360 


attgagcagg 


gggcgtacat 


aatgcacgat gatgcaaaga 


cgcctgtgaa 


tgaggtggct 


6420 


cctgccatga 


cccagtcgct 


aactgactca atcacgctca 


atgaactgga 


tgatgacagt 


6480 
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atgataatat 


cccaaaccca 


gccaacgaca 


acggaggaaa 


gtgaggcact 


gacggtggtc 


6540 


accagtccac 


ttgacacgtc 


ctcgcccagg 


gttctcgatc 


aatttgcatc 


catgttggcg 


6600 


gcgggaaaag 


gtgactccac 


acccagtagc 


tcagagcaac 


aaccaaagac 


gtctacggtg 


6660 


acgagcagca 


gcactgggcc 


caactcctcg 


acaacaggaa 


acgtctcgaa 


ggagccgcag 


6720 


gaggaggacc 


tgcaaatcca 


gtttgagtat 


gttcgagcac 


tgcagcagcg 


gatatcgcag 


6780 


. atcagcaccc 


aacggcgtaa 


gagctctaag 


ggagaggcac 


ctaacctgca 


gctaaacagt 


6840 


agcgcacctg 


tgatagaatc 


agccgaggat 


ccggccaagc 


ccgcagagga 


gcctctggtc 


6900 


tcaatgcgac 


cgcggaccac 


cagcatttcc 


ggaaaggtac 


cggagatacc 


cacacttagc 


6960 


agcaagctgg 


aagagataac 


caaagaacgc 


actaagcaaa 


aggatctgat 


tcacgaccta 


7020 


gtcatggaca 


agttgcagtc 


gaagaagcag 


ctaaacgctg 


agaagcgtct 


gcaccggagt 


7080 


cgacagcgca 


gtttgctgac 


cagtggctat 


gccagtggct 


ccagccttag 


tccgacgccc 


7140 


aagctggctg 


ctgcttgcag 


tccgcaggat 


tccaactgct 


ctagccaagc 


gcactaccac 


7200 


gcctccacgg 


cggaggaggc 


cccgaagccg 


ccggcggaaa 


ggccgttgca 


gaagtccgcc 


7260 


acgtccacct 


atgtgtcgcc 


ttatcgcact 


gtccaagcgc 


ccacacgtag tgctgatctc 


7320 


tataagccgc 


gccccttcag 


cgaacacatc 


gattcgaacg 


ctctggcggg 


ttacaagctc 


7380 


ggcaagacgg 


cctcgtttaa 


tggcggcaag 


ttgggcgact 


ttgcgaaacc 


cattgccccg 


7440 


gcgagagtta 


accgaggagg 


a ggtgtcgcg 


accgcggata 


tagccaatat 


ttccgcgtcg 


7500 


acggagaacc 


taagaagcga 


ggccagggcc 


agggctcgtc 


ttaagtctaa 


cacagagctg 


7560 


ggccttagtc 


ccgaggaaaa 


gatgcagcta 


atacgttcaa 


gattgcacta 


cgaccaaaac 


7620 


agatctctga 


agccgaagca 


actggaggag 


atgccatccg gggatctggc 


ggcacgtgcc 


7680 


cgcaaaatga 


gtgcctcgaa 


gagcgtcaat 


gatctggcct 


acatggtggg 


acagcagcag 


7740 


cagcagcagg 


ttgagaagga 


tgccgtgctc 


caagccaagg 


cggctgactt 


tacatccgat 


7800 


cccaatttgg 


cgtccggtgg 


tcaggagaag 


gcaggcaaaa 


ctaagtccgg 


acgcaggcca 


7860 


aaggatccgg 


agcggcgtaa 


gagtctcata 


cagtcgttgt 


ccagcttctt 


ccaaaaggga 


7920 


tctggatccg 


cggcctccag 


ttccaaggag 


cagggcggcg 


ctgtggctgc 


cgtccactct 


7980 


gaacagtcag 


agcgaccagg 


caccagcagc 


agcggcacgc 


ccacaatatc 


ggatgcggcg 


8040 


ggtggaggcg 


gaggaggagg 


tggcgtcttc 


agcagattcc 


gcatctcgcc 


caagtccaag 


8100 


gagaagtcaa 


agtcttgctt 


tgatctgagg 


aatttcggtt 


ttggtgacaa ggatatgctg 


8160 


gtctgcaatg 


cagcatctcc 


agcaggagcc 


acatccgcat 


cacagaaaaa 


tcactcgcaa 


8220 


gagtatctga 


acaccacgaa 


caacagtcgc 


tatcgaaagc 


aaacgaacac 


tgcgaaaccg 


8280 
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aaacccgaat 


cgttctcttc 


atccagtccg 


cagctctata 


tacacaaarr 






gccgcagctc 


atcccagtgc 


cctggacgac 


cagacaccac 


cacccaf app 




O^tUU 


ctgaattatc 


agagatccga 


t q a fc cracr acr c 

ZD ZD ^^ZD ZD 


tacgctaacg 






O A £?C\ 
O^fc O U 


caacgtgcca 


tatcgaaggc 


ttcacgacaa 


crctcracrcfc ca 


acrncra 1" t* cirri 


t3 =a ■}- rTY a 


peon 


gagattcagc 


gggaacagga 


ggagatcgag 


gtgcaactga 


acr era t c fc crcr a 


Cf cr r 1 a f cr r 1 crrr n 


QCQA 
O 3 O U 


gtgcttattg 


agaaggcctt 


gcgaggcgag 


gcgcagaata 


ttcraaaacrt 


crcra t" crr^cr;^ na 


OD1U 


aaggacaacg 


acgagaagct 


acttaaggaa 


cttttggaga 


tttggcgcaa 


catoarsopa 

W. * — C* Vw^ CL 


R70D 


ctcaagaaac 


gcgatgagga actgactata 


aQcrcaacacro 

W *"C3 ZD — ^3 71 


aac 1" err 1 a a c fc 




O / D U 


catgcccagc 


tgaaggaaga 


gctcaatctg 


cact" i~crt" rrf 




y y d (_ cici ciciy c 


D Q o n 


tctgccgatg 


tggccgccga 


gggagcaatt 


ctcaacgaga 


tgctggaaat 


tgtcgccaag 


8880 


cgagccgccc 


tacgacccac 


agcctcccag 


ctcgacctca 


eggcageggg 


atcagcatcc 


8940 


acgtccgccg 


aggcaacggg cattaagctg 


acgggacaac 


cgcatgacca 


cgaagaatcg 


9000 


atcatttga 












9009 



<210> 10 

<211> 3002 

<212> PRT 

<213> Drosophila 

<400> 10 



Met Ser Arg Gin His Gin Arg His His Gin Gin His His His Leu Pro 
15 10 15 



Pro His Gin Gin Pro Gin Gin Gin Met Pro Gin Gin Gin Gin Gin Leu 
20 25 30 



Thr Ala Gin Gin Gin Gin Gin Gin Gin Leu Leu Met Ala Glu His Ala 
35 40 45 



Ala Ala Ala Glu Ala Ala Glu Leu Phe Asp Leu Leu Cys Val Ala Thr 
50 55 60 



Thr Met Arg Gin lie Leu Ala Leu His Arg Ala Met Cys Glu Ala Val 
65 VO 75 ~ 80 



Gly Leu Arg Pro Ser Pro Leu Asn Asp Phe Tyr Pro Arg Leu Lys Ala 
85 90 95 



Lys Val Arg Ser Trp Lys Ala Gin Ala Leu Trp Lys Lys Phe Asp Ala 
100 105 "* " 110 
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Arg Ala Ala His Arg Val Tyr Gly Lys Gly Ala Ala Cys Thr Gly Thr 
115 120 125 



Arg Val Leu Val lie Gly Ala Gly Pro Cys Gly Leu Arg Thr Ala lie 
130 135 140 



Glu Ala Gin Leu Leu Gly Ala Lys Val Val Val Leu Glu Lys Arg Asp 
145 150 155 160 



Arg lie Thr Arg Asn Asn Val Leu His Leu Trp Pro Phe Val lie Thr 
165 170 175 



Asp Leu Arg Asn Leu Gly Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala 
180 185 190 



Gly Ser lie Asp His lie Ser lie Arg Gin Leu Gin Cys Met Leu Leu 
195 200 205 



Lys Val Ala Leu Leu Leu Gly Val Glu lie His Glu Gly Val Ser Phe 
210 215 220 



Asp His Ala Val Glu Pro Ser Gly Asp Gly Gly Gly Trp Arg Ala Ala 
225 230 235 240 



Val Thr Pro Ala Asp His Pro Val Ser His Tyr Glu Phe Asp Val Leu 
245 250 255 



lie Gly Ala Asp Gly Lys Arg Asn Met Leu Asp Phe Arg Arg Lys Glu 
260 265 270 



Phe Arg Gly Lys Leu Ala lie Ala lie Thr Ala Asn Phe lie Asn Lys 
275 280 285 



Lys Thr Glu Ala Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe 
290 295 300 



He Phe Asn Gin Ala Phe Phe Lys Glu Leu Tyr Gly Lys Thr Gly He 
305 310 315 320 



Asp Leu Glu Asn He Val Tyr Tyr Lys Asp Glu Thr His Tyr Phe Val 
325 330 335 



Met Thr Ala Lys Lys His Ser Leu He Asp Lys Gly Val He He Glu 
340 345 350 
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Asp Met Ala Asp Pro Gly Glu Leu Leu Ala Pro Ala Asn Val Asp Thr 
355 360 365 

Gin Lys Leu His Asp Tyr Ala Arg Glu Ala Ala Glu Phe Ser Thr Gin 
370 375 380 

Tyr Gin Met Pro Asn Leu Glu Phe Ala Val Asn His Tyr Gly Lys Pro 
385 390 395 400 

Asp Val Ala Met Phe Asp Phe Thr Ser Met Phe Ala Ala Glu Met Ser 
405 410 415 

Cys Arg Val lie Val Arg Lys Gly Ala Arg Leu Met Gin Cys Leu Val 
420 425 430 

Gly Asp Ser Leu Leu Glu Pro Phe Trp Pro Thr Gly Ser Gly Cys Ala 
435 440 445 

Arg Gly Phe Leu Ser Ser Met Asp Ala Ala Tyr Ala lie Lys Leu Trp 
450 455 460 

Ser Asn Pro Gin Asn Ser Thr Leu Gly Val Leu Ala Gin Arg Glu Ser 
465 470 475 480 

He Tyr Arg Leu Leu Asn Gin Thr Thr Pro Asp Thr Leu Gin Arg Asp 
485 490 4 95 

He Ser Ala Tyr Thr Val Asp Pro Ala Thr Arg Tyr Pro Asn Leu Asn 
500 505 510 

Arg Glu Ser Val Asn Ser Trp Gin Val Lys His Leu Val Asp Thr Asp 
515 520 525 

Asp Pro Ser He Leu Glu Gin Thr Phe Met Asp Thr His Ala Leu Gin 
530 535 540 

Thr Pro His Leu Asp Thr Pro Gly Arg Arg Lys Arg Arg Ser Gly Asp 
545 550 555 560 

Leu Leu Pro Gin Gly Ala Thr Leu Leu Arg Trp He Ser Ala Gin Leu 
565 570 575 

His Ser Tyr Gin Phe He Pro Glu Leu Lys Glu Ala Ser Asp Val Phe 
580 585 590 



Arg Asn Gly Arg Val Leu Cys Ala Leu He Asn Arg Tyr Arg Pro Asp 
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595 . 600 605 



Leu lie Asp Tyr Ala Ala Thr Lys Asp Met Ser Pro Val Glu Cys Asn 
610 615 620 



Glu Leu Ser Phe Ala Val Leu Glu Arg Glu Leu His lie Asp Arg Val 
625 630 635 ~ 640 



Met Ser Ala Lys Gin Ser Leu Asp Leu Thr Glu Leu Glu Ser Arg He 
645 650 655 



Trp Leu Asn Tyr Leu Asp Gin He Cys Asp Leu Phe Arg Gly Glu He 
660 665 670 



Pro His He Lys His Pro Lys Met Asp Phe Ser Asp Leu Arg Gin Lys 
675 680 685 



Tyr Arg He Asn His Thr His Ala Gin Pro Asp Phe Ser Lys Leu Leu 
690 695 700 



Ala Thr Lys Pro Lys Ala Lys Ser Pro Met Gin Asp Ala Val Asp He 
705 710 715 720 



Pro Thr Thr Val Gin Arg Arg Ser Val Leu Glu Glu Glu Arg Ala Lys 
725 730 735 



Arg Gin Arg Arg His Glu Gin Leu Leu Asn He Gly Gly Gly Ala Ala 
740 745 750 



Gly Ala Ala Ala Gly Val Ala Gly Ser Gly Thr Gly Thr Thr Thr Gin 
755 760 765 



Gly Gin Asn Asp Thr Pro Arg Arg Ser Lys Lys Arg Arg Gin Val Asp 
770 775 780 



Lys Thr Ala Asn lie Glu Glu Arg Gin Gin Arg Leu Gin Glu He Glu 
785 790 795 800 



Glu Asn Arg Gin Glu Arg Met Ser Lys Arg Arg Gin Gin Arg Cys His 
805 810 815 



Gin Thr Gin Asn Phe Tyr Lys Ser Leu Gin Leu Leu Gin Ala Gly Lys 
820 825 830 



Leu Leu Arg Glu Gly Gly Glu Ala Gly Val Ala Glu Asp Gly Thr Pro 
835 840 845 
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Phe Glu Asp Tyr Ser lie Phe Leu Tyr Arg Gin Gin Ala Pro Val Phe 
850 855 860 



Asn Asp Arg Val Lys Asp Leu Glu Arg Lys Leu Leu Phe Pro Asp Arg 
865 870 875 880 



Glu Arg Gly Asp lie Pro Ser Ala Leu Pro Arg Thr Ala Asp Glu Gin 
885 890 895 



Phe Ser Asp Arg He Lys Asn Met Glu Gin Arg Met Thr Gly Arg Gly 
900 905 910 



Gly Leu Gly Gly Asp Lys Lys Pro Lys Asp Leu Met Arg Ala He Gly 
915 920 925 



Lys He Asp Ser Asn Asp Trp Asn Val Arg Glu He Glu Lys Lys He 
930 935 940 



Glu Leu Ser Lys Lys Thr Glu He His Gly Pro Lys Gly Arg Glu Lys 
945 950 955 960 



Val Pro Lys Trp Ser Lys Glu Gin Phe Gin Ala Arg Gin His Lys Met 
965 970 ' 975 



Ser Lys Pro Gin Arg Gin Asp Ser Arg Glu Ala Glu Lys Phe Lys Asp 
980 985 990 



He Asp Gin Thr He Arg Asn Leu Asp Lys Gin Leu Lys Glu Gly His 
995 1000 1005 



Asn Leu Asp Val Gly Glu Arg Gly Arg Asn Lys Val Ala Ser He 
1010 1015 1020 



Ala Gly Gin Phe Gly Lys Lys Asp Glu Ala Asn Ser Asp Glu Lys 
1025 103 0 1035 



Asn Ala Gly Ser Ser Asn Ala Thr Thr Asn Thr Asn Asn Thr Val 
1040 1045 1050 



He Pro Lys Ser Ser Ser Lys Val Ala Leu Ala Phe Lys Lys Gin 
1055 1060 1065 



Ala Ala Ser Glu Lys Cys Arg Phe Cys Lys Gin Thr Val Tyr Leu 
1070 1075 1080 
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Met Glu Lys Thr Thr Val Glu Gly Leu Val Leu His Arg Asn Cys 
1085 1090 1095 



Leu Lys Cys His His Cys His Thr Asn Leu Arg Leu Gly Gly Tyr 
1100 1105 ~ 1110 



Ala Phe Asp Arg Asp Asp Pro Gin Gly Arg Phe Tyr Cys Thr Gin 
1115 1120 1125 



His Phe Arg Leu Pro Pro Lys Pro Leu Pro Gin Arg Thr Asn Lys 
1130 1135 1140 



Ala Arg Lys Ser Ala Ala Ala Gin Pro Ala Ser Pro Ala Val Pro 
1145 1150 1155 



Pro Thr Ala Gly Ser Val Pro Thr Ala Ala Ala Thr Ser Glu His 
1160 1165 1170 



Met Asp Thr Thr Pro Pro Arg Asp Gin Val Asp Leu Leu Gin Thr 
1175 1180 1185 



Ser Arg Ala Asn Ala Ser Ala Asp Ala Met Ser Asp Asp Glu Ala 
1190 1195 1200 



Asn Val He Asp Glu His Glu Trp Ser Gly Arg Asn Phe Leu Pro 
1205 1210 1215 



Glu Ser Asn Asn Asp Ser Gin Ser Glu Leu Ser Ser Ser Asp Glu 
1220 1225 1230 



Ser Asp Thr Glu Ser Asp Ser Glu Met Phe Glu Glu Ala Asp Asp 
1235 1240 1245 



Ser Pro Phe Gly Ala Gin Thr Leu Gin Leu Ala Ser Asp Trp He 
1250 1255 1260 



Gly Lys Gin Tyr Cys Glu Asp Ser Asp Asp Ser Asp Asp Phe Tyr 
1265 1270 1275 



Asp Ser Ser Glu Gly He Ala Asp Asp Gly Lys Asp Asp Thr Glu 
1280 1285 " 1290 



Gly Glu Glu Phe Lys Lys Ala Arg Glu Leu Arg Arg Gin Glu Val 
1295 1300 1305 
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Arg Leu Gin Pro Leu Pro Ala Asn Leu Pro Thr Asp Thr Glu Thr 
1310 1315 1320 



Glu Val Gin Thr Glu Ser Glu Ser Thr Ser Pro Asp Glu Val Glu 
1325 1330 1335 



Leu Asn Ser Ala Thr Glu lie Ser Thr Asp Ser Glu Phe Asp Asn 
1340 1345 1350 



Asp Glu lie lie Arg Gin Ala Pro Lys lie Phe lie Asp Asp Thr 
1355 1360 1365 



His Leu Arg Lys Pro Thr Lys Val Gin lie Lys Ser Thr Met lie 
1370 1375 1380 



Gly Pro Asn Ala Ala Ser Ala Gly Leu His Gin Lys Gin Leu Ala 
1385 1390 1395 



Ala Arg Glu Lys Gly Gly Ser Tyr Leu Gin Lys Tyr Gin Pro Gin 
1400 1405 1410 



Pro Pro Leu Ser Gin Phe Lys Pro Leu Val Gin Val Asp Pro Thr 
1415 1420 1425 



Leu Leu lie Gly Ser Gin Arg Ala Pro Leu Gin Asn Pro Arg Pro 
1430 1435 1440 



Gly Asp Tyr Leu Leu Asn Lys Thr Ala Ser Thr Glu Gly lie Ala 
1445 1450 1455 



Ser Lys Lys Ser Leu Glu Leu Lys Lys Arg Tyr Leu Leu Gly Glu 
1460 1465 1470 



Pro Ala Asn Gly Asp Lys He Gin Lys Ser Gly Ser Thr Ser Val 
1475 1480 1485 



Leu Asp Ser Arg He Arg Ser Phe Gin Ser Asn He Ser Glu Cys 
1490 1495 1500 



Gin Lys Leu Leu Asn Pro Ser Ser Asp He Ser Ala Gly Met Arg 
1505 1510 1515 



Thr Phe Leu Asp Arg Thr Lys Leu Gly Glu Gly Ser Gin Thr Thr 
1520 1525 1530 



Pro Gly Gin Thr Asn Glu Leu He Arg Ser Ala Thr Ser Asn Val 
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1535 1540 1545 

lie Asn Asp Leu Arg Val Glu Leu Arg lie Gin Lys Thr Asp Ser 
1550 1555 1560 

Ser His Ser Thr Asp Asn Glu Lys Glu Asn Val Phe Val Asn Cys 
1565 1570 1575 

Lys Asn Glu Leu Asn Lys Gly Met Glu Tyr Thr Asp Ala Val Asn 
1580 1585 1590 

Ala Thr Leu Leu Asp Gin Leu Ala Arg Lys Ser Ser Pro Thr Thr 
1595 1600 ~ 1605 

Pro Thr Asn Lys Thr Val Val Glu Val lie Asp Leu Val Thr Pro 
1610 1615 1620 

Glu Lys Pro lie Asp lie He Asp Leu Thr Ala Leu Glu Thr Pro 
1625 1630 1635 

Lys Lys Gin Leu Val Asp Gly Ser Ala Met Asp Val Asp Glu Arg 
1640 1645 1650 

Leu Thr Pro Asp Ser Asn Lys He Ser Glu Leu Gin Gin Glu Val 
1655 1660 1665 

Lys Glu Glu Pro Lys Pro Asp Val Ser Arg Asp Val Lys Glu Cys 
1670 1675 1680 

He Pro Asp lie Leu Gly His He Lys Glu Gly Thr Gly Ser Lys 
1685 1690 1695 

Glu Pro Gly Gly Glu Asp Gin Gin Ser Leu Leu Glu Gin Ser Asp 
1700 1705 1710 

Glu Glu Lys Arg Asp Ser Pro Glu Lys Asp Val Ala Glu His Glu 
1715 1720 1725 

Leu Tyr Glu Pro Asp Ser Val Gin He Gin Val Pro Asn He Pro 
1730 1735 1740 

Trp Glu Lys Ser Lys Pro Glu Val Met Ser Thr Thr Gly Ser Ser 
1745 1750 1755 



Gly Ser He Cys Ser Ser Ser Asp Ser Ser Ser He Glu Asp He 
1760 1765 1770 
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Gin His Tyr He Leu Glu Ser Thr Thr Ser Pro Asp Thr Gin Thr 
1775 1780 1785 



Val Gly Gly Lys His Asn Val Pro Arg Leu Glu Val His Asp Thr 
1790 1795 1800 



Ser Gly Ala Leu Met Gin Val Asp Ser Leu Met He Val Asn Gly 
1805 1810 1815 



Lys Tyr He Gly Asp Pro Glu Asp Val Lys Phe Leu Asp Met Pro 
1820 1825 1830 



Ala Asn Val He Val Pro Pro Ala Pro Ala Leu Lys Thr Asn Glu 
1835 1840 1845 



Leu Asp Met Glu Asp Asp Gin Glu Ala Glu Ala Glu Pro Val Thr 
1850 1855 I860 



Ala Thr Pro Glu Pro Val Glu Cys Thr Val He Glu Ala Glu Arg 
1865 1870 1875 



Arg Val Thr Ala Pro Pro Pro Leu Pro Glu Met Gly Pro Pro Lys 
1880 1885 1890 



Leu Lys Phe Asp Ser Lys Asn Glu Asn Lys He Glu Ser Leu Lys 
1895 1900 1905 



Asn Leu Pro Leu He Val Glu Ser Asn Val Glu His Ser Gin Ala 
1510 1915 1920 



Val Lys Pro He Thr Leu Asn Leu Ser Asn Leu Ala Arg Thr Pro 
1925 1930 1935 



Asp Thr Pro Thr Thr Pro Thr Ala His Asp Ser Asp Lys Thr Pro 
1940 1945 1950 



Thr Gly Glu He Leu Ser Arg Gly Ser Asp Ser Glu Thr Glu His 
1955 I960 1965 



Thr Gly Thr Gly Gin Val Leu Thr Glu Thr Glu Leu Ser Asp Trp 
1970 1975 1980 



Thr Ala Asp Asp Cys He Ser Glu Asn Phe Val Asp Leu Glu Phe 
1985 1990 1995 
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Ala Leu Asn Ser Asn Lys Gly Thr lie Lys Arg Arg Lys Asp Arg 
2000 2005 2010 



Arg Arg Ser Gly Ala Ser Lys Leu Pro Ser Gly Asn Glu Val lie 
2015 2020 2025 



His Glu ' Leu Ala Arg Gin Ala Pro Val Val Gin Met Asp Gly lie 
2030 2035 2040 



Leu Ser Ala lie Asp lie Asp Asp lie Glu Phe Met Asp Thr Gly 
2045 2050 2055 



Ser Glu Gly Ser Cys Ala Glu Ala Tyr Pro Ala Thr Asn Thr Ala 
2060 2065 207.0 



Leu lie Gin Asn Arg Gly Tyr Met Glu Tyr lie Glu Ala Glu Pro 
2075 2080 2085 



Lys Lys Thr Thr Arg Lys Ala Ala Pro Pro Ser Ser Tyr Pro Gly 
2090 2095 2100 



Asn Leu Pro Pro Leu Met Thr Lys Arg Asp Glu Lys Leu Gly Val 
2105 2110 2115 



Asp Tyr lie Glu Gin Gly Ala Tyr lie Met His Asp Asp Ala Lys 
2120 2125 2130 



Thr Pro Val Asn Glu Val Ala Pro Ala Met Thr Gin Ser Leu Thr 
2135 2140 2145 



Asp Ser lie Thr Leu Asn Glu Leu Asp Asp Asp Ser Met lie lie 
2150 2155 2160 



Ser Gin Thr Gin Pro Thr Thr Thr Glu Glu Ser Glu Ala Leu Thr 
2165 2170 2175 



Val Val Thr Ser Pro Leu Asp Thr Ser Ser Pro Arg Val Leu Asp 
2180 2185 2190 



Gin Phe Ala Ser Met Leu Ala Ala Gly Lys Gly Asp Ser Thr Pro 
2195 2200 2205 



Ser Ser Ser Glu Gin Gin Pro Lys Thr Ser Thr Val Thr Ser Ser 
2210 2215 2220 
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Ser Thr Gly Pro Asn Ser Ser Thr Thr Gly Asn Val Ser Lys Glu 
2225 2230 2235 



Pro Gin Glu Glu Asp Leu Gin lie Gin Phe Glu Tyr Val Arg Ala 
2240 2245 2250 



Leu Gin Gin Arg lie Ser Gin lie Ser Thr Gin Arg Arg Lys Ser 
2255 2260 2265 



Ser Lys Gly Glu Ala Pro Asn Leu Gin Leu Asn Ser Ser Ala Pro 
2270 2275 2280 



Val lie Glu Ser Ala Glu Asp Pro Ala Lys Pro Ala Glu Glu Pro 
2285 2290 2295 



Leu Val Ser Met Arg Pro Arg Thr Thr Ser lie Ser Gly Lys Val 
2300 2305 2310 



Pro Glu lie Pro Thr Leu Ser Ser Lys Leu Glu Glu lie Thr Lys 
l 2315 2320 2325 



Glu Arg Thr Lys Gin Lys Asp Leu lie His Asp Leu Val Met Asp 
2330 2335 2340 



Lys Leu Gin Ser Lys Lys Gin Leu Asn Ala Glu Lys Arg Leu His 
2345 2350 2355 



Arg Ser Arg Gin Arg Ser Leu Leu Thr Ser Gly Tyr Ala Ser Gly 
2360 2365 2370 



Ser Ser Leu Ser Pro Thr Pro Lys Leu Ala Ala Ala Cys Ser Pro 
2375 2380 2385 



Gin Asp Ser Asn Cys Ser Ser Gin Ala His Tyr His Ala Ser Thr 
2390 2395 2400 



Ala Glu Glu Ala Pro Lys Pro Pro Ala Glu Arg Pro Leu Gin Lys 
2405 2410 2415 



Ser Ala Thr Ser Thr Tyr Val Ser Pro Tyr Arg Thr Val Gin Ala 
2420 2425 2430 



Pro Thr Arg Ser Ala Asp Leu Tyr Lys Pro Arg Pro Phe Ser Glu 
2435 2440 2445 



His He Asp Ser Asn Ala Leu Ala Gly Tyr Lys Leu Gly Lys Thr 
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2450 2455 2460 



Ala Ser Phe Asn Gly Gly Lys Leu Gly Asp Phe Ala Lys Pro lie 
2465 2470 2475 



Ala Pro Ala Arg Val Asn Arg Gly Gly Gly Val Ala Thr Ala Asp 
2480 2485 2490 



He Ala Asn He Ser Ala Ser Thr Glu Asn Leu Arg Ser Glu Ala 
2495 2500 2505 



Arg Ala Arg Ala Arg Leu Lys Ser Asn Thr Glu Leu Gly Leu Ser 
2510 2515 2520 



Pro Glu Glu Lys Met Gin Leu He Arg Ser Arg Leu His Tyr Asp 
2525 2530 2535 



Gin Asn Arg Ser Leu Lys Pro Lys Gin Leu Glu Glu Met Pro Ser 
2540 2545 2550 



Gly Asp Leu Ala Ala Arg Ala Arg Lys Met Ser Ala Ser Lys Ser 
2555 2560 2565 



Val Asn Asp Leu Ala Tyr Met Val Gly Gin Gin Gin Gin Gin Gin 
2570 2575 2580 



Val Glu Lys Asp Ala Val Leu Gin Ala Lys Ala Ala Asp Phe Thr 
2585 2590 2595 



Ser Asp Pro Asn Leu Ala Ser Gly Gly Gin Glu Lys Ala Gly Lys 
2600 2605 2610 



Thr Lys Ser Gly Arg Arg Pro Lys Asp Pro Glu Arg Arg Lys Ser 
2615 2620 2625 



Leu He Gin Ser Leu Ser Ser Phe Phe Gin Lys Gly Ser Gly Ser 
2630 2635 2640 



Ala Ala Ser Ser Ser Lys Glu Gin Gly Gly Ala Val Ala Ala Val 
2645 2650 2655 



His Ser Glu Gin Ser Glu Arg Pro Gly Thr Ser Ser Ser Gly Thr 
2660 2665 2670 



Pro Thr He Ser Asp Ala Ala Gly Gly Gly Gly Gly Gly Gly Gly 
2675 2680 2685 
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Val Phe Ser Arg Phe Arg lie Ser Pro Lys Ser Lys Glu Lys Ser 
2690 2695 2700 



Lys Ser Cys Phe Asp Leu Arg Asn Phe Gly Phe Gly Asp Lys Asp 
2705 2710 2715 



Met Leu Val Cys Asn Ala Ala Ser Pro Ala Gly Ala Thr Ser Ala 
2720 2725 2730 



Ser Gin Lys Asn His Ser Gin Glu Tyr Leu Asn Thr Thr Asn Asn 
2735 2740 2745 



Ser Arg Tyr Arg Lys Gin Thr Asn Thr Ala Lys Pro Lys Pro Glu 
2750 2755 2760 



Ser Phe Ser Ser Ser Ser Pro Gin Leu Tyr lie His Lys Pro His 
2765 2770 2775 



His Leu Ala Ala Ala His Pro Ser Ala Leu Asp Asp Gin Thr Pro 
2780 2785 2790 



Pro Pro lie Pro Pro Leu Pro Leu Asn Tyr Gin Arg Ser Asp Asp 
2795 2800 2805 



Glu Ser Tyr Ala Asn Glu Thr Arg Glu His Lys Lys Gin Arg Ala 
2810 2815 2820 



lie Ser Lys Ala Ser Arg Gin Ala Glu Leu Lys Arg Leu Arg lie 
2825 2830 2835 



Ala Gin Glu lie Gin Arg Glu Gin Glu Glu lie Glu Val Gin Leu 
2840 2845 2850 



Lys Asp Leu Glu Ala Arg Gly Val Leu lie Glu Lys Ala Leu Arg 
2855 2860 2865 



Gly Glu Ala Gin Asn lie Glu Asn Leu Asp Ala Thr Lys Asp Asn 
2870 2875 2880 



Asp Glu Lys Leu Leu Lys Glu Leu Leu Glu lie Trp Arg Asn lie 
2885 2890 2895 



Thr Ala Leu Lys Lys Arg Asp Glu Glu Leu Thr lie Arg Gin Gin 
2900 2905 2910 
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Glu Leu Gin Leu Glu Tyr Arg His Ala Gin Leu Lys Glu Glu Leu 
2915 2920 2925 

Asn Leu Arg Leu Ser Cys Asn Lys Leu Asp Lys Ser Ser Ala Asp 
2930 2935 2940 

Val Ala Ala Glu Gly Ala lie Leu Asn Glu Met Leu Glu lie Val 
2945 2950 2955 

Ala Lys Arg Ala Ala Leu Arg Pro Thr Ala Ser Gin Leu Asp Leu 
2960 2965 2970 

Thr Ala Ala Gly Ser Ala Ser Thr Ser Ala Glu Ala Thr Gly lie 
2975 2980 2985 

Lys Leu Thr Gly Gin Pro His Asp His Glu Glu Ser lie He 



<210> 11 
<211> 8205 
<212> DNA 
<213> Drosophila 

<400> 11 

atgagccgcc aacaccagcg gcaccaccag cagcatcacc acctgccgcc gcaccagcaa 60 

ccgcagcagc agatgccgca acaacagcag cagctgacgg cgcagcagca gcaacaacag 12 0 

cagctgctga tggcggagca cgcggcggcc gcggaggcgg cggagctatt cgacctgctg 18 0 

tgcgtggcca caacgatgcg ccagatcctg gcgctccatc gggccatgtg cgaggctgtg 24 0 

ggattgagac cctcgcctct gaacgacttc tacccacggc taaaggccaa ggtgcgttcg 3 00 

tggaaggcgc aggccctgtg gaagaagttc gacgccagag ctgcccatag agtctacggc 3 60 

aa gggagctg cctgtactgg cacacgcgtc ctggtcatcg gagcagggcc ctgtggactg 42 0 

cgcaccgcca tcgaggccca actgctgggc gccaaggtgg tggtgctgga gaaacgcgat 48 0 

cgcatcaccc ggaacaatgt gctccatctg tggccattcg tcatcacgga tctgcgcaac 540 

ttgggcgcaa agaagttcta cggcaagttt tgcgccggct ccatcgatca catctccatt 60 0 

cggcagctgc agtgcatgct gctcaaggtg gcgctgctcc tgggcgtaga gatccacgag 660 

ggagtcagtt ttgatcacgc tgtagagccc tctggcgatg gcggcggatg gagggcagct 72 0 

gttactcccg cagatcatcc tgtatctcac tacgaattcg atgtgttgat cggagcggat 78 0 

ggcaagcgga atatgctgga ctttaggagg aaggagttcc gcgggaagct ggccatcgct 84 0 

attacagcga actttatcaa caagaagacg gaggcggagg ctaaagtaga ggagatcagt 9 00 
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ggggtggctt 


tcatcttcaa 


ccaggccttc 


ttcaaggagc 


tgtacgggaa 


gaegggcate 


960 


gacctggaaa 


acategtcta 


ctacaaggac 


gagaegcact 


acttegtgat 


gaeggecaag 


1020 


aagcacagtc 


taattgacaa 


gggcgttatt 


atcgaggata 


tggccgatcc 


eggegagett 


1080 


ctcgccccag 


ccaatgtgga 


tacacaaaag 


ctgcacgact 


atgeacgega 


ggctgeggag 


1140 


ttctccaccc 


aataccaaat 


gccaaacctg 


gagttcgctg 


ttaatcacta 


cggcaaacca 


1200 


gatgtggcca 


tgttcgactt 


cacatcgatg 


tttgccgccg 


agatgtcctg 


tegggtgatt 


1260 


gtgcgcaaag 


gagctcgcct 


gatgcagtgc 


ctcgtgggtg 


acagtctget 


cgagccgttt 


1320 


tggcccactg 


gatcgggttg 


tgcccgtgga 


ttcttatcca 


gcatggatgc 


tgcctatgcc 


1380 


atcaagcttt 


ggtccaaccc 


gcagaacagc 


acacttggcg 


ttctggcgca 


gegegaaage 


1440 


atctaccggc 


tgcttaacca 


gaccacgccg 


gacaccctgc 


agegggacat 


cagtgectat 


1500 


accgtggatc 


cggccacgcg 


ctatccgaat 


ctgaacaggg 


agtcggtcaa 


tagctggcag 


1560 


gtcaaacatc 


tggtcgacac 


ggacgacccg 


tccattctgg 


agcagacctt 


catggacacg 


1620 


catgctctgc 


agaccccgca 


tttggacaca 


ccgggcagac 


geaagegacg 


cagtggagac 


1680 


ttgctgcccc 


agggtgecac 


gttgetgaga 


tggataagtg 


cccagctgca 


ttcctatcag 


1740 


tttattcccg 


aactcaagga 


ggctteggat 


gtgttccgga 


atggacgcgt 


tctgtgtgcg 


1800 


cttatcaatc 


gctatcgtcc 


tgatctcatc 


gaetaegctg 


ccaccaagga 


catgagtccc 


1860 


gtggagtgca 


atgagctgtc 


attcgccgtc 


etagagegeg 


aactccacat 


cgatcgcgtc 


1920 


atgagtgcca 


aacagtcget 


ggacttgacc 


gagctggagt 


cgcgaatctg 


gctcaactat 


1980 


ttggaccaga 


tetgegaett 


gtttcgegge 


gagatccccc 


atatcaagca 


ccccaagatg 


2040 


gactttagcg 


atttgegeca 


gaagtatcgt 


atcaaccata 


cgcatgccca 


acccgacttc 


2100 


tccaagctgc 


tggcaacgaa 


acccaaggcc 


aagtegcega 


tgcaggatgc 


tgtggacata 


2160 


cccacgacag 


tgcagcggcg 


ctcggtgctc 


gaggaggagc 


gagecaageg 


gcagcgtcgc 


2220 


cac gage age 


ttcttaacat 


cggtggaggg 


gcagcaggag 


ccgccgccgg 


agttgccgga 


2280 


agegggacag 


gaaccacaac 


gcagggtcaa 


aacgataege 


cacgccggtc 


caagaagege 


2340 


cgtcaggttg 


acaaaaccgc 


caatattagt 


tccaaggtgg 


cgctggcctt 


taaaaagcag 


2400 


gctgcctccg 


aaaagtgccg 


cttctgtaag 


caaaccgtct 


acctgatgga 


gaagaccacc 


2460 


gtggagggat 


tggttctgca 


tegcaattge 


ettaagtgee 


accactgcca 


caccaacttg 


2520 


cgtctgggag 


gctacgcctt 


tgategggae 


gatcegcagg 


gecgatttta 


ctgcacccaa 


2580 


cacttccggt 


tgccacccaa 


accgctgccg 


cagcgcacca 


acaaagecag 


gaaatccget 


2640 


gccgctcaac 


ccgcctcgcc 


tgctgtacca 


ccaactgcgg 


gatccgtacc 


cactgcagct 


2700 


gccacatcgg 


agcatatgga 


caccactcca 


cccagggacc 


aggtggacct 


actggagacc 


2760 
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tcgcgagcaa 


atgcctctgc 


cgatgccatg 


tccgatgatg 


aagccaatgt 


tatcgatgag 


2820 


cacgaatggt 


ctggtcgcaa 


cttcttgccc 


gagtccaaca 


acgattccca 


atcagagcta 


2880 


tccagttcag 


atgagtcgga 


tacggaatcg 


gattcggaga 


tgtttgagga 


ggcggatgat 


2940 


tcgccgtttg 


gtgctcagac 


cctccagctg 


gcgtcggatt 


ggattggaaa 


gcaatactgt 


3000 


gaggacagtg 


atgattctga 


cgatttctac 


gactcaagtg 


aaggtattgc 


ggatgacggc 


3060 


aaagatgaca 


ccgagggtga 


ggaattcaag 


aaggcccgcg 


aattgaggcg 


ccaggaagtt 


3120 


cgcctgcagc 


cgttgcccgc 


caatctgccc 


acagatacgg 


agaccgaggt 


tcaaaccgag 


3180 


tccgagagca 


cttcacccga 


cgaagtggag 


ctcaattctg 


ccactgagat 


atccaccgac 


3240 


tctgagtttg 


acaacgatga 


gattatacgc 


caggcgccca 


aaatcttcat 


cgatgacacc 


3300 


catctaagga 


agcccaccaa 


ggttcagatc 


aagtccacca 


tgatcggacc 


caatgcagct 


3360 


tccgccggac 


tccatcagaa 


gcagttggcg 


gcgcgtgaga 


agggcggcag 


ctacctccag 


3420 


aagtaccaac 


cacaaccgcc 


actgtcacag 


tttaaaccgt 


tggtccaggt 


ggatcccacc 


3480 


ctgctcattg 


gcagccagcg 


cgctcctctt 


cagaatccac 


ggccaggaga 


ctacttgcta 


3540 


aacaagacgg 


ccagtacgga 


gggtatcgcc 


tcaaaaaaga 


gcctggagct 


aaaaaagcgc 


3600 


tatctgctgg 


gtgagccggc 


caatggcgat 


aagatccaga 


agtccggatc 


cacttcagtg 


3660 


ctggattcac 


gcattcgcag 


cttccagtcg 


aacatatcgg 


agtgccagaa 


gcttttgaat 


3720 


cccagcagcg 


acataagtgc 


cggcatgcga 


accttcctcg 


atcgcacaaa 


gttgggcgaa 


3780 


ggcagccaga 


cgacacccgg 


acagacgaac 


gaactaatcc 


gttccgccac 


cagcaatgtg 


3840 


attaacgatc 


tgcgcgtgga 


gcttcggata 


cagaaaactg 


actccagcca 


ctccacggac 


3900 


aacgagaagg 


aaaacgtttt 


cgtgaactgt 


aagaacgagc 


tgaacaaggg 


gatggaatac 


3960 


acggatgcgg 


tcaatgccac 


gctgctggac 


cagctggcca 


gaaaaagttc 


acccaccacg 


4020 


ccgacgaata 


agacggtggt 


cgaggttatt 


gacctggtta 


cacctgagaa 


gccaattgac 


4080 


attatcgatc 


taacggcact 


ggaaacgccg 


aaaaagcagt 


tggtcgatgg 


tagcgccatg 


4140 


gatgtagatg 


aacgcctcac 


acccgatagc 


aacaaaatca 


gcgaactgca 


gcaggaagtg 


4200 


aaggaggaac 


ccaagccgga 


tgtctctagg 


gatgtgaaag 


aatgcatacc 


agatatactg 


4260 


ggacacatta 


aggagggaac 


gggatcgaag 


gagccaggtg 


gagaggacca 


acagagcctg 


4320 


ctggagcagt 


cggacgaaga 


gaagcgcgac 


tcaccggaaa 


aggatgtggc 


cgaacatgag 


4380 


ctttatgaac 


cggacagtgt 


gcagatccag 


gtgcccaata 


tcccatggga 


aaaaagcaag 


4440 


ccggaggtca 


tgtctaccac 


cggcagcagt 


ggctccatct 


gctcaagctc 


agactcttct 


4500 


agtattgaag 


acatccagca 


ctacattttg 


gagtccacaa 


ctagtccaga 


tactcagaca 


4560 
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gttggcggaa agcacaatgt' gccccgtttg gaggtgcacg acacaagtgg tgccctgatg 462 0 
caggtggaca gcctgatgat tgtgaacgga aagtatattg gggatcccga ggatgtcaag 468 0 
ttcttggata tgccggccaa tgttattgtt ccgccagcac cggcgcttaa aacgaatgag 474 0 
ctggatatgg aggatgacca agaggcggag gcggaaccag taactgctac tccggagccg 48 00 
gtggaatgta cggtcatcga ggctgagcgc cgtgttactg ctccccctcc tttgccagag 4860 
atgggtccac ccaaactgaa gttcgatagc aaaaatgaga acaagatcga gagcttgaag 4 920 
aatcttccgt tgatcgtaga gagcaatgtg gagcacagtc aggcagtgaa acccattact 4 980 
cttaacttaa gcaatctggc caggacgccg gatacaccaa ccacgcccac ggcgcacgat 5 040 
agcgataaaa cacccactgg ggaaattctc tcgcgaggat ctgactcaga aaccgagcac 510 0 
actggcactg gtcaggtact aacggagacg gaactctccg actggacggc cgacgactgt 5160 
atctcggaga actttgttga cttggagttc gcgcttaact ctaacaaggg tacgataaaa 522 0 
cggcgcaagg atcgacgacg cagtggagca agcaaacttc ccagtggcaa cgaggtaatc 528 0 
cacgagctgg ccaggcaggc gccagtggtg caaatggatg gaattcttag tgccatcgac 534 0 
attgatgaca tagagttcat ggacacgggt tcggagggtt cttgtgctga agcttatccc 5400 
gcaacaaata cagctctcat tcagaataga ggttacatgg agtacatcga ggcggagccg 5460 

aaaaagacga cccgcaaggc agctccacca tccagttacc caggaaattt accgccttta 552 0 

atgacgaagc gggacgagaa actgggcgtt gattacattg agcagggggc gtacataatg 5580 

cacgatgatg caaagacgcc tgtgaatgag gtggctcctg ccatgaccca gtcgctaact 5 640 

gactcaatca cgctcaatga actggatgat gacagtatga taatatccca aacccagcca 5700 

acgacaacgg aggaaagtga ggcactgacg gtggtcacca gtccacttga cacgtcctcg 57 60 

cccagggttc tcgatcaatt tgcatccatg ttggcggcgg gaaaaggtga ctccacaccc 5820 

agtagctcag agcaacaacc aaagacgtct acggtgacga gcagcagcac tgggcccaac 5880 

tcctcgacaa caggaaacgt ctcgaaggag ccgcaggagg aggacctgca aatccagttt 5 94 0 

gagtatgttc gagcactgca gcagcggata tcgcagatca gcacccaacg gcgtaagagc 6000 

tctaagggag aggcacctaa cctgcagcta aacagtagcg cacctgtgat agaatcagcc 6060 

gaggatccgg ccaagcccgc agaggagcct ctggtctcaa tgcgaccgcg gaccaccagc 612 0 

atttccggaa aggtaccgga gatacccaca cttagcagca agctggaaga gataaccaaa 618 0 

gaacgcacta agcaaaagga tctgattcac gacctagtca tggacaagtt gcagtcgaag 624 0 

aagcagctaa acgctgagaa gcgtctgcac cggagtcgac agcgcagttt gctgaccagt 63 00 

ggctatgcca gtggctccag ccttagtccg acgcccaagc tggctgctgc ttgcagtccg 6360 

caggattcca actgctctag ccaagcgcac taccacgcct ccacggcgga ggaggccccg 6420 
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aagccgccgg cggaaaggcc gttgcagaag tccgccacgt ccacctatgt gtcgccttat 6480 
cgcactgtcc aagcgcccac acgtagtgct gatctctata agccgcgccc cttcagcgaa 6540 
cacatcgatt cgaacgctct ggcgggttac aagctcggca agacggcctc gtttaatggc 6600 
ggcaagttgg gcgactttgc gaaacccatt gccccggcga gagttaaccg aggaggaggt 6660 

gtcgcgaccg cggatatagc caatatttcc gcgtcgacgg agaacctaag aagcgaggcc 672 0 

agggccaggg ctcgtcttaa gtctaacaca gagctgggcc ttagtcccga ggaaaagatg 678 0 

cagctaatac gttcaagatt gcactacgac caaaacagat ctctgaagcc gaagcaactg 6840 

gaggagatgc catccgggga tctggcggca cgtgcccgca aaatgagtgc ctcgaagagc 6900 

gtcaatgatc tggcctacat ggtgggacag cagcagcagc agcaggttga gaaggatgcc 6960 

gtgctccaag ccaaggcggc tgactttaca tccgatccca atttggcgtc cggtggtcag 7 02 0 

gagaaggcag gcaaaactaa gtccggacgc aggccaaagg atccggagcg gcgtaagagt 7 080 

ctcatacagt cgttgtccag cttcttccaa aagggatctg gatccgcggc ctccagttcc 7140 

aaggagcagg gcggcgctgt ggctgccgtc cactctgaac agtcagagcg accaggcacc 72 00 

agcagcagcg gcacgcccac aatatcggat gcggcgggtg gaggcggagg aggaggtggc 72 60 

gtcttcagca gattccgcat ctcgcccaag tccaaggaga agtcaaagtc ttgctttgat 7320 

ctgaggaatt tcggttttgg tgacaaggat atgctggtct gcaatgcagc atctccagca 73 80 

ggagccacat ccgcatcaca gaaaaatcac tcgcaagagt atctgaacac cacgaacaac 7440 

agtcgctatc gaaagcaaac gaacactgcg aaaccgaaac ccgaatcgtt ctcttcatcc 75 00 

agtccgcagc tctatataca caagccccac cacctggccg cagctcatcc cagtgccctg 7560 

gacgaccaga caccaccacc cataccgcct cttccactga attatcagag atccgatgat 7620 

gagagctacg ctaacgagac acgagagcat aagaagcaac gtgccatatc gaaggcttca 7680 

cgacaagctg agctcaagcg attgcgaatc gctcaagaga ttcagcggga acaggaggag 7740 

atcgaggtgc aactgaagga tctggaggca cgcggcgtgc ttattgagaa ggccttgcga 78 00 

ggcgaggcgc agaatattga aaacctggat gcgacaaagg acaacgacga gaagctactt 78 60 

aaggaacttt tggagatttg gcgcaacatc acagcactca agaaacgcga tgaggaactg 792 0 

actataaggc aacaggaact gcaactggag tatcggcatg cccagctgaa ggaagagctc 7980 

aatctgcgct tgtcctgcaa caaactggac aaaagctctg ccgatgtggc cgccgaggga 8 040 

gcaattctca acgagatgct ggaaattgtc gccaagcgag ccgccctacg acccacagcc 8100 

tcccagctcg acctcacggc agcgggatca gcatccacgt ccgccgaggc aacgggcatt 8160 

aagctgacgg gacaaccgca tgaccacgaa gaatcgatca tttga 82 05 
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<210> 12 

<211> 2734 

<212> PRT 

<213> Drosophila 

<400> 12 

Met Ser Arg Gin His Gin Arg His His Gin Gin His His His Leu Pro 
1 5 10 15 



Pro His Gin Gin Pro Gin Gin Gin Met Pro Gin Gin Gin Gin Gin Leu 
20 25 30 



Thr Ala Gin Gin Gin Gin Gin Gin Gin Leu Leu Met Ala Glu His Ala 
35 40 45 



Ala Ala Ala Glu Ala Ala Glu Leu Phe Asp Leu Leu Cys Val Ala Thr 
50 55 60 



Thr Met Arg Gin lie Leu Ala Leu His Arg Ala Met Cys Glu Ala Val 
65 70 75 80 



Gly Leu Arg Pro Ser Pro Leu Asn Asp Phe Tyr Pro Arg Leu Lys Ala 
85 90 95 



Lys Val Arg Ser Trp Lys Ala Gin Ala Leu Trp Lys Lys Phe Asp Ala 
100 105 110 



Arg Ala Ala His Arg Val Tyr Gly Lys Gly Ala Ala Cys Thr Gly Thr 
115 120 125 



Arg Val Leu Val lie Gly Ala Gly Pro Cys Gly Leu Arg Thr Ala lie 
130 . 135 140 



Glu Ala Gin Leu Leu Gly Ala Lys Val Val Val Leu Glu Lys Arg Asp 
145 150 155 160 



Arg He Thr Arg Asn Asn Val Leu His Leu Trp Pro Phe Val He Thr 
165 170 175 



Asp Leu Arg Asn Leu Gly Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala 
180 185 190 



Gly Ser He Asp His He Ser He Arg Gin Leu Gin Cys Met Leu Leu 
195 200 205 



Lys Val Ala Leu Leu Leu Gly Val Glu He His Glu Gly Val Ser Phe 
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210 215 220 

Asp His Ala Val Glu Pro Ser Gly Asp Gly Gly Gly Trp Arg Ala Ala 
225 230 235 ~ 240 

Val Thr Pro Ala Asp His Pro Val Ser His Tyr Glu Phe Asp Val Leu 
245 250 255 

He Gly Ala Asp Gly Lys Arg Asn Met Leu Asp Phe Arg Arg Lys Glu 
260 265 270 

Phe Arg Gly Lys Leu Ala He Ala He Thr Ala Asn Phe He Asn Lys 
275 280 285 

Lys Thr Glu Ala Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe 
290 295 300 

He Phe Asn Gin Ala Phe Phe Lys Glu Leu Tyr Gly Lys Thr Gly He 
305 310 315 ~ ' 320 

Asp Leu Glu Asn He Val Tyr Tyr Lys Asp Glu Thr His Tyr Phe Val 
325 330 335 

Met Thr Ala Lys Lys His Ser Leu He Asp Lys Gly Val He He Glu 
340 345 ' 350 

Asp Met Ala Asp Pro Gly Glu Leu Leu Ala Pro Ala Asn Val Asp Thr 
355 360 3 65 

Gin Lys Leu His Asp Tyr Ala Arg Glu Ala Ala Glu Phe Ser Thr Gin 
370 375 380 

Tyr Gin Met Pro Asn Leu Glu Phe Ala Val Asn His Tyr Gly Lys Pro 
385 390 395 400 

Asp Val Ala Met Phe Asp Phe Thr Ser Met Phe Ala Ala Glu Met Ser 
405 410 415 

Cys Arg Val He Val Arg Lys Gly Ala Arg Leu Met Gin Cys Leu Val 
42 0 425 430 

Gly Asp Ser Leu Leu Glu Pro Phe Trp Pro Thr Gly Ser Gly Cys Ala 
435 440 ~ 445 



Arg Gly Phe Leu Ser Ser Met Asp Ala Ala Tyr Ala He Lys Leu Trp 
450 455 46 0 
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Ser Asn Pro Gin Asn Ser Thr Leu Gly Val Leu Ala Gin Arg Glu Ser 
465 470 475 480 



lie Tyr Arg Leu Leu Asn Gin Thr Thr Pro Asp Thr Leu Gin Arg Asp 
485 490 495 



lie Ser Ala Tyr Thr Val Asp Pro Ala Thr Arg Tyr Pro Asn Leu Asn 
500 505 510 



Arg Glu Ser Val Asn Ser Trp Gin Val Lys His Leu Val Asp Thr Asp 
515 520 525 



Asp Pro Ser lie Leu Glu Gin Thr Phe Met Asp Thr His Ala Leu Gin 
530 535 540 



Thr Pro His Leu Asp Thr Pro Gly Arg Arg Lys Arg Arg Ser Gly Asp 
545 550 555 560 



Leu Leu Pro Gin Gly Ala Thr Leu Leu Arg Trp lie Ser Ala Gin Leu 
565 570 575 



His Ser Tyr Gin Phe lie Pro Glu Leu Lys Glu Ala Ser Asp Val Phe 
580 585 590 



Arg Asn Gly Arg Val Leu Cys Ala Leu lie Asn Arg Tyr Arg Pro Asp 
595 600 605 



Leu lie Asp Tyr Ala Ala Thr Lys Asp Met Ser Pro Val Glu Cys Asn 
610 615 620 



Glu Leu Ser Phe Ala Val Leu Glu Arg Glu Leu His lie Asp Arg Val 
625 630 635 640 



Met Ser Ala Lys Gin Ser Leu Asp Leu Thr Glu Leu Glu Ser Arg lie 
645 650 655 



Trp Leu Asn Tyr Leu Asp Gin lie Cys Asp Leu Phe Arg Gly Glu lie 
660 665 670 



Pro His lie Lys His Pro Lys Met Asp Phe Ser Asp Leu Arg Gin Lys 
675 680 685 



Tyr Arg lie Asn His Thr His Ala Gin Pro Asp Phe Ser Lys Leu Leu 
690 695 700 
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Ala Thr Lys Pro Lys Ala Lys Ser Pro Met Gin Asp Ala Val Asp lie 
705 710 715 720 



Pro Thr Thr Val Gin Arg Arg Ser Val Leu Glu Gin Glu Arg Ala Lys 
725 730 735 



Arg Gin Arg Arg His Glu Gin Leu Leu Asn lie Gly Gly Gly Ala Ala 
740 745 750 



Gly Ala Ala Ala Gly Val Ala Gly Ser Gly Thr Gly Thr Thr Thr Gin 
755 760 765 



Gly Gin Asn Asp Thr Pro Arg Arg Ser Lys Lys Arg Arg Gin Val Asp 
770 775 780 



Lys Thr Ala Asn lie Ser Ser Lys Val Ala Leu Ala Phe Lys Lys Gin 
785 790 795 800 



Ala Ala Ser Glu Lys Cys Arg Phe Cys Lys Gin Thr Val Tyr Leu Met 
805 810 815 



Glu Lys Thr Thr Val Glu Gly Leu Val Leu His Arg Asn Cys Leu Lys 
820 825 83 0 



Cys His His Cys His Thr Asn Leu Arg Leu Gly Gly Tyr Ala Phe Asp 
835 840 845 



Arg Asp Asp Pro Gin Gly Arg Phe Tyr Cys Thr Gin His Phe Arg Leu 
850 855 860 



Pro Pro Lys Pro Leu Pro Gin Arg Thr Asn Lys Ala Arg Lys Ser Ala 
865 870 875 880 



Ala Ala Gin Pro Ala Ser Pro Ala Val Pro Pro Thr Ala Gly Ser Val 
885 890 895 



Pro Thr Ala Ala Ala Thr Ser Glu His Met Asp Thr Thr Pro Pro Arg 
900 905 910 



Asp Gin Val Asp Leu Leu Gin Thr Ser Arg Ala Asn Ala Ser Ala Asp 
915 920 925 



Ala Met Ser Asp Asp Glu Ala Asn Val lie Asp Glu His Glu Trp Ser 
930 935 940 
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Gly Arg Asn Phe Leu Pro Glu Ser Asn Asri Asp Ser Gin Ser Glu Leu 
945 950 955 960 



Ser Ser Ser Asp Glu Ser Asp Thr Glu Ser Asp Ser Glu Met Phe Glu 
965 970 975 



Glu Ala Asp Asp Ser Pro Phe Gly Ala Gin Thr Leu Gin Leu Ala Ser 
980 985 990 



Asp Trp lie Gly Lys Gin Tyr Cys Glu Asp Ser Asp Asp Ser Asp Asp 
995 1000 1005 



Phe Tyr Asp Ser Ser Glu Gly lie Ala Asp Asp Gly Lys Asp Asp 
1010 1015 1020 



Thr Glu Gly Glu Glu Phe Lys Lys Ala Arg Glu Leu Arg Arg Gin 
1025 1030 1035 



Glu Val Arg Leu Gin Pro Leu Pro Ala Asn Leu Pro Thr Asp Thr 
1040 1045 1050 



Glu Thr Glu Val Gin Thr Glu Ser Glu Ser Thr Ser Pro Asp Glu 
1055 1060 1065 



Val Glu Leu Asn Ser Ala Thr Glu lie Ser Thr Asp Ser Glu Phe 
1070 1075 1080 



Asp Asn Asp Glu lie lie Arg Gin Ala Pro Lys lie Phe lie Asp 
1085 1090 1095 



Asp Thr His Leu Arg Lys Pro Thr Lys Val Gin He Lys Ser Thr 
1100 1105 1110 



Met He Gly Pro Asn Ala Ala Ser Ala Gly Leu His Gin Lys Gin 
1115 1120 1125 



Leu Ala Ala Arg Glu Lys Gly Gly Ser Tyr Leu Gin Lys Tyr Gin 
1130 1135 1140 



Pro Gin Pro Pro Leu Ser Gin Phe Lys Pro Leu Val Gin Val Asp 
1145 1150 1155 



Pro Thr Leu Leu He Gly Ser Gin Arg Ala Pro Leu Gin Asn Pro 
1160 1165 1170 



Arg Pro Gly Asp Tyr Leu Leu Asn Lys Thr Ala Ser Thr Glu Gly 
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1175 1180 1185 



lie Ala Ser Lys Lys Ser Leu Glu Leu Lys Lys Arg Tyr Leu Leu 
1190 1195 1200 



Gly Glu Pro Ala Asn Gly Asp Lys lie Gin Lys Ser Gly Ser Thr 
1205 1210 1215 



Ser Val Leu Asp Ser Arg lie Arg Ser Phe Gin Ser Asn lie Ser 
1220 1225 1230 



Glu Cys Gin Lys Leu Leu Asn Pro Ser Ser Asp lie Ser Ala Gly 
1235 1240 1245 



Met Arg Thr Phe Leu Asp Arg Thr Lys Leu Gly Glu Gly Ser Gin 
1250 1255 1260 



Thr Thr Pro Gly Gin Thr Asn Glu Leu lie Arg Ser Ala Thr Ser 
1265 1270 1275 



Asn Val lie Asn Asp Leu Arg Val Glu Leu Arg lie Gin Lys Thr 
1280 1285 1290 



Asp Ser Ser His Ser Thr Asp Asn Glu Lys Glu Asn Val Phe Val 
1295 1300 1305 



Asn Cys Lys Asn Glu Leu Asn Lys Gly Met Glu Tyr Thr Asp Ala 
1310 1315 1320 



Val Asn Ala Thr Leu Leu Asp Gin Leu Ala Arg Lys Ser Ser Pro 
1325 1330 1335 



Thr Thr Pro Thr Asn Lys Thr Val Val Glu Val He Asp Leu Val 
1340 1345 1350 



Thr Pro Glu Lys Pro He Asp He He Asp Leu Thr Ala Leu Glu 
1355 1360 1365 



Thr Pro Lys Lys Gin Leu Val Asp Gly Ser Ala Met Asp Val Asp 
1370 1375 1380 



Glu Arg Leu Thr Pro Asp Ser Asn Lys He Ser Glu Leu Gin Gin 
1385 1390 1395 



Glu Val Lys Glu Glu Pro Lys Pro Asp Val Ser Arg Asp Val Lys 
1400 1405 1410 
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Glu Cys He Pro Asp He Leu Gly His He Lys Glu Gly Thr Gly 
1415 1420 1425 



Ser Lys Glu Pro Gly Gly Glu Asp Gin Gin Ser Leu Leu Glu Gin 
1430 1435 1440 



Ser Asp Glu Glu Lys Arg Asp Ser Pro Glu Lys Asp Val Ala Glu 
1445 1450 1455 



His Glu Leu Tyr Glu Pro Asp Ser Val Gin He Gin Val Pro Asn 
1460 1465 1470 



He Pro Trp Glu Lys Ser Lys Pro Glu Val Met Ser Thr Thr Gly 
1475 1480 1485 



Ser Ser Gly Ser He Cys Ser Ser Ser Asp Ser Ser Ser He Glu 
1490 1495 1500 



Asp He Gin His Tyr He Leu Glu Ser Thr Thr Ser Pro Asp Thr 
1505 1510 1515 



Gin Thr Val Gly Gly Lys His Asn Val Pro Arg Leu Glu Val His 
1520 1525 1530 



Asp Thr Ser Gly Ala Leu Met Gin Val Asp Ser Leu Met He Val 
1535 1540 1545 



Asn Gly Lys Tyr He Gly Asp Pro Glu Asp Val Lys Phe Leu Asp 
1550 1555 1560 



Met Pro Ala Asn Val He Val Pro Pro Ala Pro Ala Leu Lys Thr 
1565 1570 1575 



Asn Glu Leu Asp Met Glu Asp Asp Gin Glu Ala Glu Ala Glu Pro 
1580 1585 1590 



Val Thr Ala Thr Pro Glu Pro Val Glu Cys Thr Val He Glu Ala 
1595 1600 1605 



Glu Arg Arg Val Thr Ala Pro Pro Pro Leu Pro Glu Met Gly Pro 
1610 1615 1620 



Pro Lys Leu Lys Phe Asp Ser Lys Asn Glu Asn Lys He Glu Ser 
1625 1630 1635 
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Leu Lys Asn Leu Pro Leu lie Val Glu Ser Asn Val Glu His Ser 
1640 1645 1650 



Gin Ala Val Lys Pro lie Thr Leu Asn Leu Ser Asn Leu Ala Arg 
1655 1660 1665 



Thr Pro Asp Thr Pro Thr Thr Pro Thr Ala His Asp Ser Asp Lys 
1670 1675 1680 



Thr Pro Thr Gly Glu He Leu Ser Arg Gly Ser Asp Ser Glu Thr 
1685 1690 1695 



Glu His Thr Gly Thr Gly Gin Val Leu Thr Glu Thr Glu Leu Ser 
1700 1705 1710 



Asp Trp Thr Ala Asp Asp Cys He Ser Glu Asn Phe Val Asp Leu 
1715 1720 1725 



Glu Phe Ala Leu Asn Ser Asn Lys Gly Thr He Lys Arg Arg Lys 
1730 1735 1740 



Asp Arg Arg Arg Ser Gly Ala Ser Lys Leu Pro Ser Gly Asn Glu 
1745 1750 1755 



Val He His Glu Leu Ala Arg Gin Ala Pro Val Val Gin Met Asp 
1760 1765 1770 



Gly He Leu Ser Ala He Asp He Asp Asp He Glu Phe Met Asp 
1775 1780 1785 



Thr Gly Ser Glu Gly Ser Cys Ala Glu Ala Tyr Pro Ala Thr Asn 
1790 1795 1800 



Thr Ala Leu He Gin Asn Arg Gly Tyr Met Glu Tyr He Glu Ala 
1805 1810 1815 



Glu Pro Lys Lys Thr Thr Arg Lys Ala Ala Pro Pro Ser Ser Tyr 
1820 1825 1830 



Pro Gly Asn Leu Pro Pro Leu Met Thr Lys Arg Asp Glu Lys Leu 
1835 1840 1845 



Gly Val Asp Tyr He Glu Gin Gly Ala Tyr He Met His Asp Asp 
1850 1855 1860 
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Ala Lys Thr Pro Val Asn Glu Val Ala Pro Ala Met Thr Gin Ser 
1865 1870 1875 



Leu Thr Asp Ser lie Thr Leu Asn Glu Leu Asp Asp Asp Ser Met 
1880 1885 1890 



lie lie Ser Gin Thr Gin Pro Thr Thr Thr Glu Glu Ser Glu Ala 
1895 1900 1905 



Leu Thr Val Val Thr Ser Pro Leu Asp Thr Ser Ser Pro Arg Val 
1910 1915 1920 



Leu Asp Gin Phe Ala Ser Met Leu Ala Ala Gly Lys Gly Asp Ser 
1925 1930 1935 



Thr Pro Ser Ser Ser Glu Gin Gin Pro Lys Thr Ser Thr Val Thr 
1940 1945 1950 



Ser Ser Ser Thr Gly Pro Asn Ser Ser Thr Thr Gly Asn Val Ser 
1955 1960 1965 



Lys Glu Pro Gin Glu Glu Asp Leu Gin He Gin Phe Glu Tyr Val 
1970 1975 1980 



Arg Ala Leu Gin Gin Arg He Ser Gin He Ser Thr Gin Arg Arg 
1985 1990 1995 



Lys Ser Ser Lys Gly Glu Ala Pro Asn Leu Gin Leu Asn Ser Ser 
2000 2005 2010 



Ala Pro Val He Glu Ser Ala Glu Asp Pro Ala Lys Pro Ala Glu 
2015 2020 2025 



Glu Pro Leu Val Ser Met Arg Pro Arg Thr Thr Ser He Ser Gly 
2030 2035 2040 



Lys Val Pro Glu He Pro Thr Leu Ser Ser Lys Leu Glu Glu He 
2045 2050 2055 



Thr Lys Glu Arg Thr Lys Gin Lys Asp Leu He His Asp Leu Val 
2060 2065 2070 



Met Asp Lys Leu Gin Ser Lys Lys Gin Leu Asn Ala Glu Lys Arg 
2075 2080 2085 



Leu His Arg Ser Arg Gin Arg Ser Leu Leu Thr Ser Gly Tyr Ala 
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2090 2095 2100 



Ser Gly Ser Ser Leu Ser Pro Thr Pro Lys Leu Ala Ala Ala Cys 
2105 2110 2115 



Ser Pro Gin Asp Ser Asn Cys Ser Ser Gin Ala His Tyr His Ala 
2120 2125 2130 



Ser Thr Ala Glu Glu Ala Pro Lys Pro Pro Ala Glu Arg Pro Leu 
2135 2140 2145 



Gin Lys Ser Ala Thr Ser Thr Tyr Val Ser Pro Tyr Arg Thr Val 
2150 2155 2160 



Gin Ala Pro Thr Arg Ser Ala Asp Leu Tyr Lys Pro Arg Pro Phe 
2165 2170 2175 



Ser Glu His lie Asp Ser Asn Ala Leu Ala Gly Tyr Lys Leu Gly 
2180 2185 2190 



Lys Thr Ala Ser Phe Asn Gly Gly Lys Leu Gly Asp Phe Ala Lys 
2195 2200 2205 



Pro lie Ala Pro Ala Arg Val Asn Arg Gly Gly Gly Val Ala Thr 
2210 2215 ~ "* 2220 



Ala Asp lie Ala Asn lie Ser Ala Ser Thr Glu Asn Leu Arg Ser 
2225 2230 2235 



Glu Ala Arg Ala Arg Ala Arg Leu Lys Ser Asn Thr Glu Leu Gly 
2240 2245 2250 



Leu Ser Pro Glu Glu Lys Met Gin Leu lie Arg Ser Arg Leu His 
2255 2260 2265 



Tyr Asp Gin Asn Arg Ser Leu Lys Pro Lys Gin Leu Glu Glu Met 
2270 2275 2280 



Pro Ser Gly Asp Leu Ala Ala Arg Ala Arg Lys Met Ser Ala Ser 
2285 2290 2295 



Lys Ser Val Asn Asp Leu Ala Tyr Met Val Gly Gin Gin Gin Gin 
2300 2305 2310 



Gin Gin Val Glu Lys Asp Ala Val Leu Gin Ala Lys Ala Ala Asp 
2315 2320 2325 
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Phe Thr Ser Asp Pro Asn Leu Ala Ser Gly Gly Gin Glu Lys Ala 
2330 2335 2340 



Gly Lys Thr Lys Ser Gly Arg Arg Pro Lys Asp Pro Glu Arg Arg 
2345 2350 2355 



Lys Ser Leu lie Gin Ser Leu Ser Ser Phe Phe Gin Lys Gly Ser 
2360 2365 2370 



Gly Ser Ala Ala Ser Ser Ser Lys Glu Gin Gly Gly Ala Val Ala 
2375 2380 2385 



Ala Val His Ser Glu Gin Ser Glu Arg Pro Gly Thr Ser Ser Ser 
2390 2395 2400 



Gly Thr Pro Thr lie Ser Asp Ala Ala Gly Gly Gly Gly Gly Gly 
2405 2410 2415 



Gly Gly Val Phe Ser Arg Phe Arg lie Ser Pro Lys Ser Lys Glu 
2420 2425 2430 



Lys Ser Lys Ser Cys Phe Asp Leu Arg Asn Phe Gly Phe Gly Asp 
2435 2440 2445 



Lys Asp Met Leu Val Cys Asn Ala Ala Ser Pro Ala Gly Ala Thr 
2450 2455 2460 



Ser Ala Ser Gin Lys Asn His Ser Gin Glu Tyr Leu Asn Thr Thr 
2465 2470 2475 



Asn Asn Ser Arg Tyr Arg Lys Gin Thr Asn Thr Ala Lys Pro Lys 
2480 2485 2490 



Pro Glu Ser Phe Ser Ser Ser Ser Pro Gin Leu Tyr He His Lys 
2495 2500 2505 



Pro His His Leu Ala Ala Ala His Pro Ser Ala Leu Asp Asp Gin 
2510 2515 2520 



Thr Pro Pro Pro He Pro Pro Leu Pro Leu Asn Tyr Gin Arg Ser 
2525 2530 2535 



Asp Asp Glu Ser Tyr Ala Asn Glu Thr Arg Glu His Lys Lys Gin 
2540 2545 2550 
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Arg Ala lie Ser Lys Ala Ser Arg Gin Ala Glu Leu Lys Arg Leu 
2555 2560 2565 



Arg He Ala Gin Glu He Gin Arg Glu Gin Glu Glu He Glu Val 
2570 2575 2580 



Gin Leu Lys Asp Leu Glu Ala Arg Gly Val Leu He Glu Lys Ala 
2585 2590 2595 



Leu Arg Gly Glu Ala Gin Asn He Glu Asn Leu Asp Ala Thr Lys 
2600 2605 2610 



Asp Asn Asp Glu Lys Leu Leu Lys Glu Leu Leu Glu He Trp Arg 
2615 2620 2625 



Asn He Thr Ala Leu Lys Lys Arg Asp Glu Glu Leu Thr He Arg 
2630 2635 2640 



Gin Gin Glu Leu Gin Leu Glu Tyr Arg His Ala Gin Leu Lys Glu 
2645 2650 2655 



Glu Leu Asn Leu Arg Leu Ser Cys Asn Lys Leu Asp Lys Ser Ser 
2660 2665 2670 



Ala Asp Val Ala Ala Glu Gly Ala He Leu Asn Glu Met Leu Glu 
2675 2680 2685 



He Val Ala Lys Arg Ala Ala Leu Arg Pro Thr Ala Ser Gin Leu 
2690 2695 2700 



Asp Leu Thr Ala Ala Gly Ser Ala Ser Thr Ser Ala Glu Ala Thr 
2705 2710 2715 



Gly He Lys Leu Thr Gly Gin Pro His Asp His Glu Glu Ser He 
2720 2725 2730 



He 



<210> 13 

<211> 5411 

<212> DNA 

<213> Homo sapiens 



<400> 13 

ggccaagccg gggccccgaa gccagagccg gagccgggcg ggccgcgggg tcatggctgg 



60 
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gccgcggggc 


gcgctgctgg 


cctggtgccg 


ccgccagtgc 


gagggctacc 


gcggcgtgga 


120 


gatccgcgac 


ctgagcagct 


ccttccggga 


cggcctggcc 


ttctgcgcca 


tcctgcaccg 


180 


gcaccggccc 


gacctgctag 


attttgattc 


gctttccaag 


gacaatgtct 


tcgagaataa 


240 


ccgtttggcc 


tttgaagtgg 


ctgagaagga 


gctggggatc 


cccgctctcc 


tggaccccaa 


300 


tgacatggtc 


tccatgagcg 


tccctgactg 


cctcagcatc 


atgacctatg 


tgtcccagta 


360 


ttacaaccac 


ttctgcagtc 


ctggccaagc 


tggtgtctcg 


ccacccagaa 


agggccttgc 


420 


accctgttcc 


ccgccgtctg 


tagcacccac 


tccagtggaa 


ccagaagatg 


tggctcaggg 


480 


cgaggagctc 


tcctcaggca 


gcctgtcaga 


gcagggcacc 


ggccagaccc 


ccagcagcac 


540 


gtgcgcagcc 


tgccagcagc 


atgtgcactt 


ggtgcagcgc 


tacctggctg 


acggcaggct 


600 


gtaccatcgc 


cactgcttcc 


ggtgtcggcg 


gtgctccagc 


accctgctcc 


ctggggctta 


660 


tgagaatggg 


cctgaggagg 


gcacctttgt 


gtgtgcagaa 


cactgtgcca 


ggctgggccc 


720 


ggggacacgg 


tcggggacca 


ggcctgggcc 


cttctcacag 


ccaaagcagc 


agcaccagca 


780 


gcaactcgca 


gaagatgcca 


aggatgttcc 


aggaggcggc 


cccagctcca 


gtgctcctgc 


840 


aggggctgag 


gccgatggac 


ccaaggccag 


ccctgaggcc 


cggccgcaga 


tccctaccaa 


900 


gccccgggtt 


cctggcaaac 


tacaggagct 


ggccagcccc 


cctgcgggcc 


gccccacccc 


960 


tgcccccagg 


aaggcctctg 


agagcaccac 


cccagcaccc 


cccacgcccc 


ggccccgctc 


1020 


cagtctgcag 


caggagaacc 


tggtggagca 


ggctggcagc 


agcagcctgg 


tgaacgggag 


1080 


actgcacgaa 


ctgcctgtcc 


ccaagccgag 


ggggacaccg 


aagccgtccg 


aggggacacc 


1140 


agcccccagg 


aaggaccccc 


catggatcac 


gctggtgcag 


gcagaaccaa 


agaagaagcc 


1200 


agccccactt 


cccccaagca 


gcagcccggg 


gccaccaagc 


caggacagca 


ggcaggtgga 


1260 


gaatggaggc 


accgaggagg 


tggcccagcc 


gagcccaacg 


gccagcctgg 


agtccaaacc 


1320 


ctataacccc 


tttgaggagg 


aggaggagga 


caaggaggaa 


gaggctccag 


ctgcacccag 


1380 


cctggccacc 


agccctgccc 


tgggccaccc 


ggagtccaca 


cccaagtccc 


tgcacccctg 


1440 


gtacggcatc 


acccctacca 


gcagccccaa 


gacaaagaag 


cgccctgccc 


cgcgcgcacc 


1500 


cagcgcgtcc 


ccactggctc 


tccacgcctc 


ccgcctctcg 


cactcggagc 


cgccctcggc 


1560 


cacaccatcg 


ccagcgctca 


gcgtggagag 


cctgtcgtct 


gagagcgcca 


gccagactgc 


1620 


aggtgcagag 


cttctggagc 


cgccagctgt 


gcccaagagc 


tcctcagagc 


ctgctgtcca 


1680 


tgcccctggt 


acccctggaa 


accctgtcag 


cctctctacc 


aactcctccc 


tggcctcctc 


1740 


tggggaacta 


gtggagccta 


gagtggaaca 


aatgcctcaa 


gccagccctg 


gccttgcccc 


1800 


caggaccagg 


ggcagctcag 


gtccccagcc 


agccaagccc 


tgcagtggcg 


ccaccccaac 


1860 


gcctctcttg 


ttggttggag 


acaggagccc 


ggtgccttcc 


cctggaagct 


cgtccccaca 


1920 
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gctgcaggta 


aagtcctcct 


gcaaggagaa 


tccttttaac 


cggaagccat 


cacctgcagc 


1980 


gtccccagcc 


acaaagaagg 


ccaccaaggg 


atccaagcca 


gtgaggccac 


ctgcccctgg 


2040 


acacggcttt 


ccactcatca 


aacgcaaggt 


ccaggctgac 


cagtacatcc 


ctgaggagga 


2100 


catccatgga 


gagatggata 


ccattgagcg 


ccggctggat 


gccctggagc 


accgtggggt 


2160 


gctgctggag gagaagctgc 


gtggcggcct 


gaatgagggc 


cgtgaggatg 


acatgctggt 


2220 


ggactggttc 


aagctcatcc 


acgagaagca 


cctactggtg 


cggcgagagt 


ccgagctcat 


2280 


ctatgtcttc 


aagcagcaga 


acctggagca 


gcgccaggct 


gatgtcgagt 


atgagctccg 


2340 


gtgcctcctc 


aataagccag 


aaaaggactg 


gacggaggag 


gaccgggccc 


gggagaaggt 


2400 


gctgatgcag 


gagcttgtga 


ccctcattga 


gcagcgcaac 


gctatcatca 


actgcctgga 


2460 


tgaggaccgg 


cagagggagg 


aagaggaaga 


caagatgttg 


gaagccatga 


tcaagaagaa 


2520 


agagttccag agggaggctg aacctgaggg 


caagaagaag 


gggaagttca 


agaccatgaa 


2580 


gatgttgaaa ctgctaggaa acaaacgtga 


tgccaagagc 


aagtccccca 


gagacaagag 


2640 


ctaacagcac 


gagaagccag 


ttggggactg 


ccccctcctg 


gagcagctcc 


tgggctgtgc 


2700 


tctgtttgaa gggggcgccc 


tgctcccctc 


agatcagtca 


ggaggaagat 


gactaagggg 


2760 


agggatcctc 


tgggtgatgg 


cctcttcctc 


ctcagggacc 


tctgactgct 


ctgggccaaa 


2820 


gaatctcttg 


tttcttctcc 


gagccccagg 


cagcggtgat 


tcagccctgc 


ccaacctgat 


2880 


tctgatgact 


gcggatgctg 


tgacggaccc 


aaggggcaaa 


tagggtccca 


gggtccaggg 


2940 


aggggcgcct 


gctgagcact 


tccgcccctc 


accctgccca 


gcccctgcca tgagctctgg 


3000 


gctgggtctc 


cgcctccagg gttctgctct 


tccaggcagg 


ccagcaagtg 


gcgctgggcc 


3060 


acactggctt 


cttcctgccc 


catccctggc 


tctgagtctc 


tgtcttcctg 


tcctgtgcag 


3120 


gcgcccttgg 


atctcagttt 


ccctcactca 


ggaactctgt 


ttctgaagtc 


ttcagttaag 


3180 


tttgagttta tgactgagtg gcctgtactg tcagacgtga atgggcctga cgggcaaatc 


3240 


catccctctc 


tccctcacag 


ttccaggagc 


ggcttccctc 


gtctcccctt 


actccacagg 


3300 


gagcctccct 


tgccaggacc 


agggctgcga 


cggccatgct 


ggggcaggtg 


agtgctctgt 


3360 


tagctgctcc 


cagtgctgtc 


cccaggctgc 


agttctggtc 


cctggttgtc 


aggtaggaag 


3420 


ggtgcacttg 


aagcaggtgc 


tcatctcggt 


tccttaacgt 


ttatagtctg 


acccctcact 


3480 


taggctttcc 


tctgccaccc 


cggtccaggg 


aagaggctcg 


ctcccgccca 


tggtcatcac 


3540 


tggtctgtct 


gctctgttgt 


ctgttctttc 


cctgactccc 


tcccaccgaa ggcctgatgg 


3600 


ctactcaccc 


ctctgggatg 


gctatgggag 


aggaggagtg 


atggggaccg 


ccaccttttc 


3660 


tgcaggaaat 


gtgcccagca 


gctcttggtc 


aaagcactgt 


tgctataagc 


tatctctggg 


3720 
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atgcctctag 


gcccccttcc 


ctctacacac 


ctctgggaaa 


agattacact 


gtattaactc 


3780 


tcgaggagtt 


tcctcaccaa 


taaacagaca 


acctcaactg 


ccagtgccct 


gcagcctcgg 


3840 


gccacagcgg 


cagccttgtt 


tgccttccca 


cctgcctctg 


ccacacctgg 


tggctgaaca 


3900 


tctctggtcg 


cccagaggcc 


atgttggggc 


catcctccaa 


gagggatctc 


tgccctcacc 


3960 


gcctgccact 


gggcaggatc 


cctttcctct 


gcagggagag 


gtggctcctc 


ggccatgcag 


4020 


cccctggcag 


gctccttcta 


aacatgcctg 


ttgacctgga 


gctggcgcca 


ccaactccag 


4080 


ggcctttcca 


gggccagaca 


ggtaacacgc 


atgaacccga 


gtgacagctc 


tgacgggctg 


4140 


tttcggtgtc 


aggagacaaa 


gctggcaggg 


gcaggggtga 


actggaggca 


agtcaagtca 


4200 


cctgtggcct 


gtggggctga 


atgtgggccc 


ggtgttgcca 


gatcctttgt 


cataagaagc 


4260 


tagaaatcca 


gattttatgt 


gtgtgtaatt 


tgtaaatgct 


gaaagctagc 


ctgaattttt 


4320 


tttttttttt 


tttgagacag 


agtctcgctc 


tgtcgcccag 


gctggagtgc 


agtggcgcga 


4380 


tctcagctca 


ctgcaagctc 


cgcctcctgg 


gttcacgcca 


tcctcctgcc 


tcggcctcct 


4440 


gagcagctgg 


gactacaggc 


gcatgctacg 


acgcctggct 


aattttttgt 


atttttagta 


4500 


gagacggggt 


ttcaccgtgt 


taaccaggat 


ggtctcgatc 


tcctgacctt 


gtgatccacc 


4560 


caccttggcc 


tcccaaagtg 


ctgggattac 


aggcgtgagc 


caccacgccc 


ggccactagc 


4620 


ctgaatttca 


atcaagggtt 


ggctgatact 


gtgtgtccag 


ggtggactgg 


atttgtcctg 


4680 


gggggttctc 


tggtttgctg 


cctcctgacc 


acatgatggg 


gccttcgagg 


tcgaggacaa 


4740 


ctgttcccat 


tagattgcac 


cctctgccct 


caggttcttg 


agggtgtgtg 


gacacagagg 


4800 


ctttccatgg 


gatgtccctg 


agccggccct 


tgattggggc 


ctcaccattt 


acagggccgt 


4860 


tttattctgc 


aaaccgaaac 


ttgggtcatg 


tgacctgatg 


ggattatggg 


actccctcca 


4920 


ggtgcccgag 


acaaggttga 


tatttccaaa 


atattttggt 


gatttagtgg 


gacaagcaaa 


4980 


tgacagaata 


ccggagaagg 


cagggatcgt 


gggtgtcagg 


agccagaggg 


gagggggaca 


5040 


gatgtgctgt 


gtacaggaca 


aggtgtcagg 


tgactccttc 


ccagcagggc 


ctcgcagatg 


5100 


cacaagcacg 


gagctggtgg 


gttttgccca 


agaaaggtca 


cgcggcacat 


gcagggattg 


5160 


gaactcccag 


gccagggctc 


taggtcgctc 


ccaccttttc 


atgtttcttt 


ctgtggccat 


5220 


gggtatagtg 


gaaagacata 


aagctaaagc 


caacttttaa 


tcctgaatgc 


actgcttgcc 


5280 


aggtaaatgc 


ccttggttgt 


ggtatcttgt 


tgagacttag 


ttttcacaga 


gggataatga 


5340 


accgttgcag 


aggtttattg 


agatcattaa 


cagagtggaa 


ttcagcaccc 


gccacagcag 


5400 


ccagccatgg 


9 










5411 



<210> 
<211> 



14 

863 
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<212> PRT 

<213> Homo sapiens 

<400> 14 

Met Ala Gly Pro Arg Gly Ala Leu Leu Ala Trp Cys Arg Arg Gin Cys 
15 10 15 



Glu Gly Tyr Arg Gly Val Glu lie Arg Asp Leu Ser Ser Ser Phe Arg 
20 25 30 



Asp Gly Leu Ala Phe Cys Ala lie Leu His Arg His Arg Pro Asp Leu 
35 40 45 



Leu Asp Phe Asp Ser Leu Ser Lys Asp Asn Val Phe Glu Asn Asn Arg 
50 55 60 



Leu Ala Phe Glu Val Ala Glu Lys Glu Leu Gly lie Pro Ala Leu Leu 
65 70 75 80 



Asp Pro Asn Asp Met Val Ser Met Ser Val Pro Asp Cys Leu Ser lie 
85 90 95 



Met Thr Tyr Val Ser Gin Tyr Tyr Asn His Phe Cys Ser Pro Gly Gin 
100 105 110 



Ala Gly Val Ser Pro Pro Arg Lys Gly Leu Ala Pro Cys Ser Pro Pro 
115 120 125 



Ser Val Ala Pro Thr Pro Val Glu Pro Glu Asp Val Ala Gin Gly Glu 
130 135 140 



Glu Leu Ser Ser Gly Ser Leu Ser Glu Gin Gly Thr Gly Gin Thr Pro 
145 150 155 160 



Ser Ser Thr Cys Ala Ala Cys Gin Gin His Val His Leu Val Gin Arg 
165 170 175 



Tyr Leu Ala Asp Gly Arg Leu Tyr His Arg His Cys Phe Arg Cys Arg 
180 185 190 



Arg Cys Ser Ser Thr Leu Leu Pro Gly Ala Tyr Glu Asn Gly Pro Glu 
195 200 205 



Glu Gly Thr Phe Val Cys Ala Glu His Cys Ala Arg Leu Gly Pro Gly 
210 215 220 
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Thr Arg Ser Gly Thr Arg Pro Gly Pro Phe Ser Gin Pro Lys Gin Gin 
225 230 235 240 



His Gin Gin Gin Leu Ala Glu Asp Ala Lys Asp Val Pro Gly Gly Gly 
245 250 255 



Pro Ser Ser Ser Ala Pro Ala Gly Ala Glu Ala Asp Gly Pro Lys Ala 
260 265 270 



Ser Pro Glu Ala Arg Pro Gin He Pro Thr Lys Pro Arg Val Pro Gly 
275 280 285 



Lys Leu Gin Glu Leu Ala Ser Pro Pro Ala Gly Arg Pro Thr Pro Ala 
290 295 300 



Pro Arg Lys Ala Ser Glu Ser Thr Thr Pro Ala Pro Pro Thr Pro Arg 
305 310 315 320 



Pro Arg Ser Ser Leu Gin Gin Glu Asn Leu Val Glu Gin Ala Gly Ser 
325 330 335 



Ser Ser Leu Val Asn Gly Arg Leu His Glu Leu Pro Val Pro Lys Pro 
340 345 350 



Arg Gly Thr Pro Lys Pro Ser Glu Gly Thr Pro Ala Pro Arg Lys Asp 
355 360 365 



Pro Pro Trp He Thr Leu Val Gin Ala Glu Pro Lys Lys Lys Pro Ala 
370 375 380 



Pro Leu Pro Pro Ser Ser Ser Pro Gly Pro Pro Ser Gin Asp Ser Arg 
385 390 395 400 



Gin Val Glu Asn Gly Gly Thr Glu Glu Val Ala Gin Pro Ser Pro Thr 
405 410 415 



Ala Ser Leu Glu Ser Lys Pro Tyr Asn Pro Phe Glu Glu Glu Glu Glu 
420 425 430 



Asp Lys Glu Glu Glu Ala Pro Ala Ala Pro Ser Leu Ala Thr Ser Pro 
435 440 445 



Ala Leu Gly His Pro Glu Ser Thr Pro Lys Ser Leu His Pro Trp Tyr 
450 455 460 



Gly He Thr Pro Thr Ser Ser Pro Lys Thr Lys Lys Arg Pro Ala Pro 
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465 470 475 480 



Arg Ala Pro Ser Ala Ser Pro Leu Ala Leu His Ala Ser Arg Leu Ser 
485 490 495 



His Ser Glu Pro Pro Ser Ala Thr Pro Ser Pro Ala Leu Ser Val Glu 
500 505 510 



Ser Leu Ser Ser Glu Ser Ala Ser Gin Thr Ala Gly Ala Glu Leu Leu 
515 520 525 



Glu Pro Pro Ala Val Pro Lys Ser Ser Ser Glu Pro Ala Val His Ala 
530 535 540 



Pro Gly Thr Pro Gly Asn Pro Val Ser Leu Ser Thr Asn Ser Ser Leu 
545 550 555 560 



Ala Ser Ser Gly Glu Leu Val Glu Pro Arg Val Glu Gin Met Pro Gin 
565 570 575 



Ala Ser Pro Gly Leu Ala Pro Arg Thr Arg Gly Ser Ser Gly Pro Gin 
580 585 590 



Pro Ala Lys Pro Cys Ser Gly Ala Thr Pro Thr Pro Leu Leu Leu Val 
595 600 605 



Gly Asp Arg Ser Pro Val Pro Ser Pro Gly Ser Ser Ser Pro Gin Leu 
610 615 620 



Gin Val Lys Ser Ser Cys Lys Glu Asn Pro Phe Asn Arg Lys Pro Ser 
625 630 635 640 



Pro Ala Ala Ser Pro Ala Thr Lys Lys Ala Thr Lys Gly Ser Lys Pro 
645 650 655 



Val Arg Pro Pro Ala Pro Gly His Gly Phe Pro Leu lie Lys Arg Lys 
660 665 670 



Val Gin Ala Asp Gin Tyr lie Pro Glu Glu Asp He His Gly Glu Met 
675 680 685 



Asp Thr He Glu Arg Arg Leu Asp Ala Leu Glu His Arg Gly Val Leu 
690 695 700 



Leu Glu Glu Lys Leu Arg Gly Gly Leu Asn Glu Gly Arg Glu Asp Asp 
705 710 715 720 
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Met Leu Val Asp Trp Phe Lys Leu lie His Glu Lys His Leu Leu Val 
725 730 735 



Arg Arg Glu Ser Glu Leu lie Tyr Val Phe Lys Gin Gin Asn Leu Glu 
740 745 750 



Gin Arg Gin Ala Asp Val Glu Tyr Glu Leu Arg Cys Leu Leu Asn Lys 
755 760 ~ 765 



Pro Glu Lys Asp Trp Thr Glu Glu Asp Arg Ala Arg Glu Lys Val Leu 
770 775 780 



Met Gin Glu Leu Val Thr Leu lie Glu Gin Arg Asn Ala lie He Asn 
785 790 795 800 



Cys Leu Asp Glu Asp Arg Gin Arg Glu Glu Glu Glu Asp Lys Met Leu 
805 810 815 



Glu Ala Met He Lys Lys Lys Glu Phe Gin Arg Glu Ala Glu Pro Glu 
820 825 830 



Gly Lys Lys Lys Gly Lys Phe Lys Thr Met Lys Met Leu Lys Leu Leu 
835 840 845 



Gly Asn Lys Arg Asp Ala Lys Ser Lys Ser Pro Arg Asp Lys Ser 
850 855 860 



<210> 15 

<211> 2685 

<212> DNA 

<213> Homo sapiens 

<400> 15 



atggcggcca 


tcagggcgct 


gcaacagtgg 


tgccggcagc 


agtgcgaggg 


ctaccgcgac 


60 


gtgaatatct 


gcaacatgac 


cacgtcgttc 


cgcgacggcc 


tggctttctg 


cgccatcctg 


120 


caccgccacc 


ggcccgacct 


cataaacttc 


agtgctctca 


agaaggaaaa 


tatttatgaa 


180 


aacaataaac 


tggccttccg 


cgtggccgag 


gagcacttgg 


gcatcccagc 


cttgctggat 


240- 


gccgaggaca 


tggtggcctt 


gaaggtgcct 


gaccggctga 


gcatcttgac 


ctacgtgtcc 


300 


cagtattaca 


actacttcca 


cggccgctcc 


cccattgggg 


gcatggcagg 


cgtgaagagg 


360 


gcctcggagg 


actctgagga 


ggagccgtca 


gggaagaagg 


ctccagtcca 


ggcggccaag 


420 


ctgccctcgc 


ccgccccagc 


ccggaagcct 


ccactatctc 


cagcccagac 


aaaccctgtg 


480 


gtccagagga 


ggaatgaggg 


tgcagggggc 


ccgcccccca 


agactgacca 


ggcattggcg 


540 
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ggcagcttgg 


tcagcagcac 


ctgcggggtc 


tgcggcaagc 


acgtgcacct 


ggtacagcgg 


600 


cacctggccg acgggaggct 


ttaccaccgg agctgcttca 


ggtgtaagca 


gtgctcctgc 


660 


acgctgcact 


cgggggccta 


caaggccaca 


ggagagccgg 


gcaccttcgt 


ctgcaccagc 


720 


cacctccccg 


cagccgcctc 


tgcaagcccc 


aagttgacgg 


gtctggtccc 


ccgacagcca 


780 


ggggccatgg 


gtgtggattc 


caggacctcc 


tgttccccac 


agaaggccca 


ggaggcaaac 


840 


aaggccagac 


cgttggcctg 


ggagcctgct 


gcgggcaact 


cgcctgccag 


ggcttccgtt 


900 


ccagctgcac 


ccaaccctgc 


agccaccagc 


gccacgtccg 


tccacgtgag 


gagcccagcc 


960 


aggccctctg agagccgcct 


ggcccccact 


cccacggagg 


ggaaagtccg 


ccctcgtgtg 


1020 


accaatagct 


ccccgatggg 


ctggtcgtca 


gctgccccgt 


gcacagcagc 


ggctgcctcc 


1080 


catcccgccg 


tgcccccgag 


tgccccagac 


cctcgcccgg 


ccacacccca 


gggcggggga 


1140 


gccccccgag tggcagctcc 


tcaaaccaca 


ctcagttcaa 


gctccacatc 


tgcagccacg 


1200 


gtggaccccc 


cagcctggac 


cccgtccgcc 


tccaggaccc 


agcaggcccg 


gaataagttt 


1260 


ttccaaacat 


cagcagtgcc 


ccccggcacc 


agcctttctg 


gcagaggtcc 


caccccgtca 


1320 


cttgttctat 


ccaaggacag 


cagcaaggag 


caggcgcgga 


acttcctcaa 


gcaggccctc 


1380 


tcagcgctgg 


aagaggctgg 


cgctccggcg 


cctggcaggc 


cctccccagc 


cactgccgct 


1440 


gttcccagtt 


ctcagcccaa 


aactgaagca 


ccacaagcaa 


gtcccttagc 


caagccgtta 


1500 


cagtcctcgt 


ctccccgggt 


gcttggcctc 


ccttcgagga 


tggaaccgcc 


agccccgctg 


1560 


agcacgagca 


gtacctctca 


ggcatccgcg 


ttgcccccgg 


caggcaggag 


gaacttggct 


1620 


gaatcctcag gggtcggcag 


ggtgggtgct 


ggctccaggc 


cgaagccaga 


ggccccgatg 


1680 


gcaaagggta 


aaagcaccac 


cttaacgcag 


gacatgagca 


ccagcctcca 


ggaaggccag 


1740 


gaggacgggc 


cggcaggatg 


gagagcgaat 


ctgaagcccg 


tggacaggag 


gagcccagct 


1800 


gagaggactc 


tgaagcccaa 


ggaaccacgg gccctggcag 


agccgagggc 


gggggaggcc 


1860 


cccaggaagg 


tctcaggcag 


ctttgctggg 


agtgtccaca 


tcaccctgac 


ccccgtgagg 


1920 


cctgacagga 


ccccacgccc 


agccagccca 


ggacccagcc 


tcccagccag 


gtccccctcc 


1980 


ccaccccgcc 


gcaggagact 


ggccgtccct 


gccagcctcg 


acgtttgtga 


caactggctt 


2040 


cggccggagc 


cccctggcca 


ggaagcccga 


gtgcagagct 






2100 


cctcaccttc 


agggcagacc 


agggagaccc 


ttgtccccgg 


ccaatgtccc 


tgctctgcct 


2160 


ggcgagacgg tgacctcccc 


agtcaggctg 


caccccgact 


acctctcccc 


ggaggagata 


2220 


cagaggcagc 


tgcaggacat 


cgagaggcgg 


ctggacgccc 


tggagctccg 


cggcgtggag 


2280 


ctggagaagc gactgcgggc 


ggccgaggga 


gatgacgctg 


aggatagcct 


catggtggac 


2340 
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tggttctggc 


tcattcacga 


gaagcagctt 


ctgctgagac 


aggagtcaga 


gctgatgtac 


2400 


aagtccaagg 


cccagcgtct 


ggaggagcag 


cagctggaca 


tcgagggcga 


gctgcgccgg 


2460 


ctcatggcca 


agcccgaggc 


tctgaagtca 


ctgcaggagc 


ggcggcggga 


gcaggagctg 


2520 


ctggagcagt 


acgtgagcac 


cgtgaacgac 


cgcagtgaca 


tcgtggactc 


gctggacgag 


2580 


gaccggctcc 


gggaacaaga 


ggaggatcag 


atgctgcggg 


acatgattga 


gaagctgggc 


2640 


ctccagagga 


agaagtccaa 


gttccgcttg 


tccaagatct 


ggtca 




2685 



<210> 16 

<211> 904 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Ala Ala He Arg Ala Leu Gin Gin Trp Cys Arg Gin Gin Cys Glu 
15 10 15 



Gly Tyr Arg Asp Val Asn He Cys Asn Met Thr Thr Ser Phe Arg Asp 
20 25 3 0 



Gly Leu Ala Phe Cys Ala He Leu His Arg His Arg Pro Asp Leu He 
3 5 40 45 



Asn Phe Ser Ala Leu Lys Lys Glu Asn He Tyr Glu Asn Asn Lys Leu 
50 55 60 



Ala Phe Arg Val Ala Glu Glu His Leu Gly He Pro Ala Leu Leu Asp 
65 7 0 75 8 0 



Ala Glu Asp Met Val Ala Leu Lys Val Pro Asp Arg Leu Ser He Leu 
85 90 95 



Thr Tyr Val Ser Gin Tyr Tyr Asn Tyr Phe His Gly Arg Ser Pro He 
100 105 110 



Gly Gly Met Ala Gly Val Lys Arg Ala Ser Glu Asp Ser Glu Glu Glu 
115 12 0 125 



Pro Ser Gly Lys Lys Ala Pro Val Gin Ala Ala Lys Leu Pro Ser Pro 
13 0 135 14 0 



Ala Pro Ala Arg Lys Pro Pro Leu Ser Pro Ala Gin Thr Asn Pro Val 
145 " ~ 150 155 160 



Val Gin Arg Arg Asn Glu Gly Ala Gly Gly Pro Pro Pro Lys Thr Asp 
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165 170 175 



Gin Ala Leu Ala Gly Ser Leu Val Ser Ser Thr Cys Gly Val Cys Gly 
180 185 190 



Lys His Val His Leu Val Gin Arg His Leu Ala Asp Gly Arg Leu Tyr 
195 200 205 



His Arg Ser Cys Phe Arg Cys Lys Gin Cys Ser Cys Thr Leu His Ser 
210 215 220 



Gly Ala Tyr Lys Ala Thr Gly Glu Pro Gly Thr Phe Val Cys Thr Ser 
225 230 235 240 



His Leu Pro Ala Ala Ala Ser Ala Ser Pro Lys Leu Thr Gly Leu Val 
245 250 255 



Pro Arg Gin Pro Gly Ala Met Gly Val Asp Ser Arg Thr Ser Cys Ser 
260 265 270 



Pro Gin Lys Ala Gin Glu Ala Asn Lys Ala Arg Pro Leu Ala Trp Glu 
275 280 285 



Pro Ala Ala Gly Asn Ser Pro Ala Arg Ala Ser Val Pro Ala Ala Pro 
290 295 300 



Asn Pro Ala Ala Thr Ser Ala Thr Ser Val His Val Arg Ser Pro Ala 
305 310 315 320 



Arg Pro Ser Glu Ser Arg Leu Ala Pro Thr Pro Thr Glu Gly Lys Val 
325 330 335 



Arg Pro Arg Val Thr Asn Ser Ser Pro Met Gly Trp Ser Ser Ala Ala 
340 345 350 



Pro Cys Thr Ala Ala Ala Ala Ser His Pro Ala Val Pro Pro Ser Ala 
355 360 365 



Pro Asp Pro Arg Pro Ala Thr Pro Gin Gly Gly Gly Ala Pro Arg Val 
370 375 380 



Ala Ala Pro Gin Thr Thr Leu Ser Ser Ser Ser Thr Ser Ala Ala Thr 
385 390 395 400 



Val Asp Pro Pro Ala Trp Thr Pro Ser Ala Ser Arg Thr Gin Gin Ala 
405 410 415 
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Arg Asn Lys Phe Phe Gin Thr Ser Ala Val Pro Pro Gly Thr Ser Leu 
420 425 430 



Ser Gly Arg Gly Pro Thr Pro Ser Leu Val Leu Ser Lys Asp Ser Ser 
435 440 445 



Lys Glu Gin Ala Arg Asn Phe Leu Lys Gin Ala Leu Ser Ala Leu Glu 
450 ' 455 460 



Glu Ala Gly Ala Pro Ala Pro Gly Arg Pro Ser Pro Ala Thr Ala Ala 
465 470 475 480 



Val Pro Ser Ser Gin Pro Lys Thr Glu Ala Pro Gin Ala Ser Pro Leu 
485 490 495 



Ala Lys Pro Leu Gin Ser Ser Ser Pro Arg Val Leu Gly Leu Pro Ser 
500 505 " 510 



Arg Met Glu Pro Pro Ala Pro Leu Ser Thr Ser Ser Thr Ser Gin Ala 
515 520 525 



Ser Ala Leu Pro Pro Ala Gly Arg Arg Asn Leu Ala Glu Ser Ser Gly 
530 535 540 



Val Gly Arg Val Gly Ala Gly Ser Arg Pro Lys Pro Glu Ala Pro Met 
545 550 555 560 



Ala Lys Gly Lys Ser Thr Thr Leu Thr Gin Asp Met Ser Thr Ser Leu 
565 570 575 



Gin Glu Gly Gin Glu Asp Gly Pro Ala Gly Trp Arg Ala Asn Leu Lys 
580 585 590 



Pro Val Asp Arg Arg Ser Pro Ala Glu Arg Thr Leu Lys Pro Lys Glu 
595 600 605 



Pro Arg Ala Leu Ala Glu Pro Arg Ala Gly Glu Ala Pro Arg Lys Val 
610 615 620 



Ser Gly Ser Phe Ala Gly Ser Val His lie Thr Leu Thr Pro Val Arg 
625 630 635 640 



Pro Asp Arg Thr Pro Arg Pro Ala Ser Pro Gly Pro Ser Leu Pro Ala 
645 650 655 



WO 03/066821 



105/161 



PCT/US03/03551 



Arg Ser Pro Ser Pro Pro Arg Arg Arg Arg Leu Ala Val Pro Ala Ser 
660 665 670 



Leu Asp Val Cys Asp Asn Trp Leu Arg Pro Glu Pro Pro Gly Gin Glu 
675 680 685 



Ala Arg Val Gin Ser Trp Lys Glu Glu Glu Lys Lys Pro His Leu Gin 
690 695 700 



Gly Arg Pro Gly Arg Pro Leu Ser Pro Ala Asn Val Pro Ala Leu Pro 
705 710 715 720 



Gly Glu Thr Val Thr Ser Pro Val Arg Leu His Pro Asp Tyr Leu Ser 
725 730 735 



Pro Glu Glu lie Gin Arg Gin Leu Gin Asp lie Glu Arg Arg Leu Asp 
740 745 750 



Ala Leu Glu Leu Arg Gly Val Glu Leu Glu Lys Arg Leu Arg Ala Ala 
755 760 765 



Glu Gly Asp Asp Ala Glu Asp Ser Leu Met Val Asp Trp Phe Trp Leu 
770 775 780 



lie His Glu Lys Gin Leu Leu Leu Arg Gin Glu Ser Glu Leu Met Tyr 
785 790 795 800 



Lys Ser Lys Ala Gin Arg Leu Glu Glu Gin Gin Leu Asp lie Glu Gly 
805 810 815 



Glu Leu Arg Arg Leu Met Ala Lys Pro Glu Ala Leu Lys Ser Leu Gin 
820 825 830 



Glu Arg Arg Arg Glu Gin Glu Leu Leu Glu Gin Tyr Val Ser Thr Val 
835 840 845 



Asn Asp Arg Ser Asp lie Val Asp Ser Leu Asp Glu Asp Arg Leu Arg 
850 855 860 



Glu Gin Glu Glu Asp Gin Met Leu Arg Asp Met lie Glu Lys Leu Gly 
865 870 875 880 



Leu Gin Arg Lys Lys Ser Lys Phe Arg Leu Ser Lys lie Trp Ser Pro 
885 890 895 
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Lys Ser Lys Ser Ser Pro Ser Gin 
900 

<210> 17 
<211> 3351 
<212> DNA 
<213> Drosophila 

<400> 17 



caccgttttg 


ccttgtttac 


atttaatgta 


atttgtacca 


gataaatctt 


atttgatttt 


60 


actcaagtca 


agtggcacga 


tgtgcttttg 


ttgttcactt 


agtttacagt 


ttgtgactgc 


120 


tattttcgtt 


tagttatatt 


cgccaataag 


ccagcaatta 


acccactaaa 


atgtctgatc 


180 


gtcgtggcac 


aaaagttgga 


actggtacga 


aggctttgga 


gtattggtgc 


cgagttgtga 


240 


cccaaggata 


taatggggtc 


aaggtggaga 


acatgaccac 


ttcctggcga 


aatggacttg 


300 


ccttctgcgc 


cataatacac 


cactttcggc 


cggatcttat 


agatttcgac 


cgacttaaag 


360 


cagacgatat 


ctatgagaac 


aacgatttgg 


cctttacaac 


ggccgagaaa 


tatttgggaa 


420 


ttcccgcact 


gctcgatgca 


gctgacatgg 


tttcgtatga 


agtacctgat 


agactctcca 


480 


tactaacgta 


tttatcccag 


ttttacaagg 


tgctcggcaa 


gagcctgaag 


cacccgaagc 


540 


cagaggagcc 


acttggcgaa 


gagagtgagc 


caccgcaaaa 


ggtgatgcac 


attgttggga 


600 


tgcctcgacg 


tgataagtgc 


cagaaatgca 


accttcctgt 


ttttctcgcc 


gaaagggtcc 


660 


ttgtcggcaa 


acgggcttat 


catcgcacat 


gcttaaaatg 


tgcaagatgt 


tcgtcattgc 


720 


tcacacccgg 


gagtttttat 


gaaacggagg 


tcaataacat 


atactgttgc 


gagacttgtc 


780 


ccgatgaaga 


aagtgaaccg 


gaatctgaca 


ttttaaagtt 


aaagacaacc 


actactgatt 


840 


ctccgaatga 


taaacaaatg 


gtggcacaaa 


gttctgatta 


ctctgaagct 


gaagataaac 


900 


aagaagacct 


ggaagataat 


gatatacgta 


ctactgataa 


gcctgaaaac 


ttccaaccgc 


960 


cgtccaacaa 


agatgaacaa 


aataatgaac 


taactattaa 


tccggtaaac 


cctatattat 


1020 


cagaggagcg 


taaaatctct 


tttattccac 


tagacgaaga 


agatggtggc 


cttattgaac 


1080 


aatataataa 


atcaacaact 


cctgtaaagc 


ccgctatacc 


agagaaacca 


aaagtctcta 


1140 


ctcttccact 


agacgatgaa 


caacatgctg 


gcgttgaaca 


aaacaatgat 


ctggcggtta 


1200 


gtccggaaaa 


tgacattcca 


aaagagaaac 


ttaagatttc 


ttcggtctca 


atatatcttg 


1260 


aagatgaccg 


ccttgttgta 


gatgcaatac 


atccggataa 


tttagataaa 


caagaagctt 


1320 


tgaacaatac 


atcagacgca 


ctcattccag 


aatcacagga 


agcaccgatt 


cctgaaaata 


1380 


acactcaagt 


agccattaaa 


ccagaagacc 


atataagccc 


gcgtaaagaa 


aataaaattt 


1440 


tctcaaacac 


agaaagctgc 


tctaagcagg 


aaggtgttct 


cccaaaacaa 


atggatctcg 


1500 


agtctcccaa 


ggacaaggta 


attgagacaa 


aagcatctga 


aactgactat 


ccagaggatc 


1560 
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taaatccctt taaagatgat 


gactcatcta 


aaggtgccaa 


cccattcgac 


agctctgatg 


1620 


atgaagtgga gcttcttaaa 


gccattccag 


cccaacaaag 


caaaggaaaa 


gttgtaccgc 


1680 


cccgtccacc gccacctaaa 


attggccttt 


catccatctc 


caatccgtca 


gagaaaccac 


1740 


attccagccc tacactttcg 


catgggaaaa 


aaatgccgat 


gccaacaccg 


agaatatcta 


1800 


tatcaaaaac ccagactcca 


gcaaaaccaa 


tgacgcatca 


aggccagaaa 


agtagtattt 


1860 


cctcttcgtc ttcggagcat 


ttgaatagca 


taagaacgtt 


cgatcgcgga 


gcagatgacc 


1920 


gtggctcaag catctccttg 


ccatctgcaa 


atgggcctcg 


caagccactt 


cgcgcctcag 


1980 


tggggagtcc gctgcgaagc 


gaagaatcga 


gtcccactac 


cagtcttagc 


tctattacct 


2040 


ctccgatgcg gaagaagcgc 


caggcacctt 


tgcccccaat 


acaaacggac 


tttgacagtg 


2100 


atcctggatt ttcaaaattg 


tccgacgaac 


aaaaggcatt 


gttgcacact 


cagcttaagg 


2160 


ccccaaatct gggtgattca 


accagaagac 


taataccact 


tgaccagagt 


ctgctatccg 


2220 


atgaggcaac cgagtcgagt 


aattacgacg 


agagcttaag 


tacatccaat 


gccgacgagg 


2280 


aggtaaatgt ggtataccga 


cgcattttgg 


tgccgccaac 


tcaacccgaa 


aacactgttg 


2340 


aaegtagcaa ggaggatcaa 


aaatcgccta 


tcgtgtataa 


cgacttcgat 


agaaacgtaa 


2400 


gcccattggg gcacaataaa 


tccactcatg 


ggaaatggaa 


gaggcgaaag 


ggaccagcac 


2460 


cagcggttcc aataccaccg 


cgcaaagtct 


tacaaaggct 


tccgctgcaa 


gaaattcgcc 


2520 


acgagttcga aattattgcg 


gtacagcagc 


tgggtcttga 


gaaacaaggc 


gttattctgg 


2580 


agaaaatgat cagggatcgc 


tgtgagcgtt 


ccttagatgc 


caccgatact 


gatggtcccg 


2640 


aatccgcaga agtgctaacc 


aactcaaagg 


aagtggaaga 


tcttatattg 


caactgttcg 


2700 


agctggtcaa cgagaaaaac 


gaattgttcc 


gcagacaagc 


ggaactgatg 


taccttcgac 


2760 


gtcagcatcg cctggagcag 


gagcaggcgg 


acatagagca 


tgagatacga 


gtattaatgg 


2820 


ggcaaccgga gcacaacaaa 


accgattcgg 


ataaagccca 


tgaggaagta 


ttaatcaatc 


2880 


gccttgtgaa ggtggttgaa 


atgcgtaacg 


aagtaattga 


tagcctagag 


actgaccgag 


2940 


ttcgcgaggc gcgagaggat 


atgagcatca 


agaatcggct 


tcatatatac 


aattctgagc 


3000 


gcgaggaacc accagcacat 


ccgagaagcg 


ctgacaaatc 


atccaaaaag 


ctgtctaaaa 


3060 


aggaacgaaa aaaacttaag 


gaggaa a a t a 


agctgggcaa 




i~ caa?} npt* ca 

l — ^— v-j cu. V — V. — l_ V_ V— j 


3 12 0 


acaaggacgt cgatgagtcc 


gaacaggcac 


cagctttgga 


aaaggttaaa 


aagaagcgaa 


3180 


acttgttctt tttgaaaatg 


tagttggggg 


ctggttgagt 


tgaagtgttt 


acgcatcgaa 


3240 


caattgaaat ttattttata 


taaatttgta 


acctattaag 


tttctattgc 


catatgtgag 


3300 


catttcaata aataaagtat 


atattttcct 


ttaaaaaaaa 


aaaaaaaaaa 


a 


3351 
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<210> 18 

<211> 1010 

<212> PRT 

<213> Drosophila 

<400> 18 

Met Ser Asp Arg Arg Gly Thr Lys Val Gly Thr Gly Thr Lys Ala Leu 
1 5 10 15 



Glu Tyr Trp Cys Arg Val Val Thr Gin Gly Tyr Asn Gly Val Lys Val 
20 25 30 



Glu Asn Met Thr Thr Ser Trp Arg Asn Gly Leu Ala Phe Cys Ala lie 
35 40 45 



lie His His Phe Arg Pro Asp Leu lie Asp Phe Asp Arg Leu Lys Ala 
50 55 60 



Asp Asp lie Tyr Glu Asn Asn Asp Leu Ala Phe Thr Thr Ala Glu Lys 
65 70 75 80 



Tyr Leu Gly He Pro Ala Leu Leu Asp Ala Ala Asp Met Val Ser Tyr 
85 90 95 



Glu Val Pro Asp Arg Leu Ser He Leu Thr Tyr Leu Ser Gin Phe Tyr 
100 105 110 



Lys Val Leu Gly Lys Ser Leu Lys His Pro Lys Pro Glu Glu Pro Leu 
115 120 125 



Gly Glu Glu Ser Glu Pro Pro Gin Lys Val Met His He Val Gly Met 
130 135 140 



Pro Arg Arg Asp Lys Cys Gin Lys Cys Asn Leu Pro Val Phe Leu Ala 
145 150 155 160 



Glu Arg Val Leu Val Gly Lys Arg Ala Tyr His Arg Thr Cys Leu Lys 
165 170 175 



Cys Ala Arg Cys Ser Ser Leu Leu Thr Pro Gly Ser Phe Tyr Glu Thr 
180 185 190 



Glu Val Asn Asn He Tyr Cys Cys Glu Thr Cys Pro Asp Glu Glu Ser 
195 200 205 



Glu Pro Glu Ser Asp He Leu Lys Leu Lys Thr Thr Thr Thr Asp Ser 
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210 215 220 



Pro Asn Asp Lys Gin Met Val Ala Gin Ser Ser Asp Tyr Ser Glu Ala 
225 230 235 240 



Glu Asp Lys Gin Glu Asp Leu Glu Asp Asn Asp lie Arg Thr Thr Asp 
245 250 255 



Lys Pro Glu Asn Phe Gin Pro Pro Ser Asn Lys Asp Glu Gin Asn Asn 
260 265 270 



Glu Leu Thr lie Asn Pro Val Asn Pro lie Leu Ser Glu Glu Arg Lys 
275 280 285 



lie Ser Phe He Pro Leu Asp Glu Glu Asp Gly Gly Leu He Glu Gin 
290 295 300 



Tyr Asn Lys Ser Thr Thr Pro Val Lys Pro Ala He Pro Glu Lys Pro 
305 310 315 320 



Lys Val Ser Thr Leu Pro Leu Asp Asp Glu Gin His Ala Gly Val Glu 
325 330 335 



Gin Asn Asn Asp Leu Ala Val Ser Pro Glu Asn Asp He Pro Lys Glu 
340 345 350 



Lys Leu Lys He Ser Ser Val Ser He Tyr Leu Glu Asp Asp Arg Leu 
355 360 365 



Val Val Asp Ala He His Pro Asp Asn Leu Asp Lys Gin Glu Ala Leu 
370 375 380 



Asn Asn Thr Ser Asp Ala Leu He Pro Glu Ser Gin Glu Ala Pro He 
385 390 395 400 



Pro Glu Asn Asn Thr Gin Val Ala He Lys Pro Glu Asp His He Ser 
405 410 415 



Pro Arg Lys Glu Asn Lys He Phe Ser Asn Thr Glu Ser Cys Ser Lys 
420 425 430 



Gin Glu Gly Val Leu Pro Lys Gin Met Asp Leu Glu Ser Pro Lys Asp 
435 440 445 



Lys Val He Glu Thr Lys Ala Ser Glu Thr Asp Tyr Pro Glu Asp Leu 
450 455 460 
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Asn Pro Phe Lys Asp Asp Asp Ser Ser Lys Gly Ala Asn Pro Phe Asp 
465 470 475 480 



Ser Ser Asp Asp Glu Val Glu Leu Leu Lys Ala lie Pro Ala Gin Gin 
485 490 495 



Ser Lys Gly Lys Val Val Pro Pro Arg Pro Pro Pro Pro Lys lie Gly 
500 505 510 



Leu Ser Ser lie Ser Asn Pro Ser Glu Lys Pro His Ser Ser Pro Thr 
515 520 525 



Leu Ser His Gly Lys Lys Met Pro Met Pro Thr Pro Arg lie Ser lie 
530 535 540 



Ser Lys Thr Gin Thr Pro Ala Lys Pro Met Thr His Gin Gly Gin Lys 
545 550 555 560 



Ser Ser lie Ser Ser Ser Ser Ser Glu His Leu Asn Ser lie Arg Thr 
565 570 575 



Phe Asp Arg Gly Ala Asp Asp Arg Gly Ser Ser lie Ser Leu Pro Ser 
580 585 590 



Ala Asn Gly Pro Arg Lys Pro Leu Arg Ala Ser Val Gly Ser Pro Leu 
595 600 605 



Arg Ser Glu Glu Ser Ser Pro Thr Thr Ser Leu Ser Ser lie Thr Ser 
610 615 620 



Pro Met Arg Lys Lys Arg Gin Ala Pro Leu Pro Pro lie Gin Thr Asp 
625 630 635 640 



Phe Asp Ser Asp Pro Gly Phe Ser Lys Leu Ser Asp Glu Gin Lys Ala 
645 650 655 



Leu Leu His Thr Gin Leu Lys Ala Pro Asn Leu Gly Asp Ser Thr Arg 
660 665 670 



Arg Leu lie Pro Leu Asp Gin Ser Leu Leu Ser Asp Glu Ala Thr Glu 
675 680 685 



Ser Ser Asn Tyr Asp Glu Ser Leu Ser Thr Ser Asn Ala Asp Glu Glu 
690 695 700 
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Val Asn Val Val Tyr Arg Arg lie Leu Val Pro Pro Thr Gin Pro Glu 
705 710 715 720 



Asn Thr Val Glu Arg Ser Lys Glu Asp Gin Lys Ser Pro lie Val Tyr 
725 730 735 



Asn Asp Phe Asp Arg Asn Val Ser Pro Leu Gly His Asn Lys Ser Thr 
740 745 750 



His Gly Lys Trp Lys Arg Arg Lys Gly Pro Ala Pro Ala Val Pro lie 
755 760 765 



Pro Pro Arg Lys Val Leu Gin Arg Leu Pro Leu Gin Glu lie Arg His 
770 775 780 



Glu Phe Glu lie He Ala Val Gin Gin Leu Gly Leu Glu Lys Gin Gly 
785 790 795 800 



Val He Leu Glu Lys Met He Arg Asp Arg Cys Glu Arg Ser Leu Asp 
805 810 815 



Ala Thr Asp Thr Asp Gly Pro Glu Ser Ala Glu Val Leu Thr Asn Ser 
820 825 830 



Lys Glu Val Glu Asp Leu He Leu Gin Leu Phe Glu Leu Val Asn Glu 
835 840 845 



Lys Asn Glu Leu Phe Arg Arg Gin Ala Glu Leu Met Tyr Leu Arg Arg 
850 855 860 



Gin His Arg Leu Glu Gin Glu Gin Ala Asp He Glu His Glu He Arg 
865 870 875 880 



Val Leu Met Gly Gin Pro Glu His Asn Lys Thr Asp Ser Asp Lys Ala 
885 890 895 



His Glu Glu Val Leu He Asn Arg Leu Val Lys Val Val Glu Met Arg 
900 905 910 



Asn Glu Val He Asp Ser Leu Glu Thr Asp Arg Val Arg Glu Ala Arg 
915 920 925 



Glu Asp Met Ser He Lys Asn Arg Leu His He Tyr Asn Ser Glu Arg 
930 935 940 
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Glu Glu Pro Pro Ala His Pro Arg Ser Ala Asp Lys Ser Ser Lys Lys 
945 950 955 960 



Leu Ser Lys Lys Glu Arg Lys Lys Leu Lys Glu Glu Asn Lys Leu Gly 
965 970 975 



Lys Gly Lys Lys Ser Asp Leu Asp Lys Asp Val Asp Glu Ser Glu Gin 
980 985 990 



Ala Pro Ala Leu Glu Lys Val Lys Lys Lys Arg Asn Leu Phe Phe Leu 
995 1000 1005 



Lys Met 
1010 



<210> 


19 




<211> 


202 




<212> 


PRT 




<213> 


Artificial 


sequence 


<220> 






<223> 


Drosophila 


truncated 


<400> 


19 





Met Asn Tyr Gin Arg Ser Asp Asp Glu Ser Tyr Ala Asn Glu Thr Arg 
15 10 15 



Glu His Lys Lys Gin Arg Ala lie Ser Lys Ala Ser Arg Gin Ala Glu 
20 25 30 



Leu Lys Arg Leu Arg lie Ala Gin Glu lie Gin Arg Glu Gin Glu Glu 
35 40 45 



He Glu Val Gin Leu Lys Asp Leu Glu Ala Arg Gly Val Leu He Glu 
50 55 60 



Lys Ala Leu Arg Gly Glu Ala Gin Asn He Glu Asn Leu Asp Ala Thr 
65 70 75 80 



Lys Asp Asn Asp Glu Lys Leu Leu Lys Glu Leu Leu Glu He Trp Arg 
85 90 95 



Asn He Thr Ala Leu Lys Lys Arg Asp Glu Glu Leu Thr He Arg Gin 
100 105 iio 



Gin Glu Leu Gin Leu Glu Tyr Arg His Ala Gin Leu Lys Glu Glu Leu 
115 120 125 
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Asn Leu Arg Leu Ser Cys Asn Lys Leu Asp Lys Ser Ser Ala Asp Val 
13 0 135 140 

Ala Ala Glu Gly Ala He Leu Asn Glu Met Leu Glu He Val Ala Lys 
145 150 155 160 

Arg Ala Ala Leu Arg Pro Thr Ala Ser Gin Leu Asp Leu Thr- Ala Ala 
165 170 175 

Gly Ser Ala Ser Thr Ser Ala Glu Ala Thr Gly He Lys Leu Thr Gly 
180 185 ~ 190 

Gin Pro His Asp His Glu Glu Ser He He 
195 200 

<210> 20 

<211> 3002 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Drosophila G-W mutant. G residues 134, 136, 139 of Drosophila MI 
CAL changed to W residues 

<400> 20 

Met Ser Arg Gin His Gin Arg His His Gin Gin His His His Leu Pro 
1 5 io 15 

Pro His Gin Gin Pro Gin Gin Gin Met Pro Gin Gin Gin Gin Gin Leu 
20 25 30 

Thr Ala Gin Gin Gin Gin Gin Gin Gin Leu Leu Met Ala Glu His Ala 
35 , 40 45 

Ala Ala Ala Glu Ala Ala Glu Leu Phe Asp Leu Leu Cys Val Ala Thr 
50 55 60 

Thr Met Arg Gin He Leu Ala Leu His Arg Ala Met Cys Glu Ala Val 
65 7 0 75 8 0 

Gly Leu Arg Pro Ser Pro Leu Asn Asp Phe Tyr Pro Arg Leu Lys Ala 
85 90 " 95 

Lys Val Arg Ser Trp Lys Ala Gin Ala Leu Trp Lys Lys Phe Asp Ala 
100 105 110 



Arg Ala Ala His Arg Val Tyr Gly Lys Gly Ala Ala Cys Thr Gly Thr 
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115 120 125 



Arg Val Leu Val lie Trp Ala Trp Pro Cys Trp Leu Arg Thr Ala He 
130 135 140 



Glu Ala Gin Leu Leu Gly Ala Lys Val Val Val Leu Glu Lys Arg Asp 
145 150 155 160 



Arg He Thr Arg Asn Asn Val Leu His Leu Trp Pro Phe Val He Thr 
165 170 175 



Asp Leu Arg Asn Leu Gly Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala 
180 185 190 



Gly Ser He Asp His He Ser He Arg Gin Leu Gin Cys Met Leu Leu 
195 200 205 



Lys Val Ala Leu Leu Leu Gly Val Glu He His Glu Gly Val Ser Phe 
210 215 220 



Asp His Ala Val Glu Pro Ser Gly Asp Gly Gly Gly Trp Arg Ala Ala 
225 230 235 240 



Val Thr Pro Ala Asp His Pro Val Ser His Tyr Glu Phe Asp Val Leu 
245 250 255 



He Gly Ala Asp Gly Lys Arg Asn Met Leu Asp Phe Arg Arg Lys Glu 
260 265 270 



Phe Arg Gly Lys Leu Ala He Ala He Thr Ala Asn Phe He Asn Lys 
275 280 285 



Lys Thr Glu Ala Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe 
290 295 300 



He Phe Asn Gin Ala Phe Phe Lys Glu Leu Tyr Gly Lys Thr Gly He 
305 310 315 320 



Asp Leu Glu Asn He Val Tyr Tyr Lys Asp Glu Thr His Tyr Phe Val 
325 330 335 



Met Thr Ala Lys Lys His Ser Leu He Asp Lys Gly Val He He Glu 
340 345 350 



Asp Met Ala Asp Pro Gly Glu Leu Leu Ala Pro Ala Asn Val Asp Thr 
355 360 365 
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Gin Lys Leu His Asp Tyr Ala Arg Glu Ala Ala Glu Phe Ser Thr Gin 
370 375 380 



Tyr Gin Met Pro Asn Leu Glu Phe Ala Val Asn His Tyr Gly Lys Pro 
385 390 395 400 



Asp Val Ala Met Phe Asp Phe Thr Ser Met Phe Ala Ala Glu Met Ser 
405 410 415 



Cys Arg Val lie Val Arg Lys Gly Ala Arg Leu Met Gin Cys Leu Val 
420 425 430 



Gly Asp Ser Leu Leu Glu Pro Phe Trp Pro Thr Gly Ser Gly Cys Ala 
435 440 445 



Arg Gly Phe Leu Ser Ser Met Asp Ala Ala Tyr Ala lie Lys Leu Trp 
450 455 460 



Ser Asn Pro Gin Asn Ser Thr Leu Gly Val Leu Ala Gin Arg Glu Ser 
465 470 475 480 



He Tyr Arg Leu Leu Asn Gin Thr Thr Pro Asp Thr Leu Gin Arg Asp 
485 490 495 



He Ser Ala Tyr Thr Val Asp Pro Ala Thr Arg Tyr Pro Asn Leu Asn 
500 505 510 



Arg Glu Ser Val Asn Ser Trp Gin Val Lys His Leu Val Asp Thr Asp 
515 520 525 



Asp Pro Ser He Leu Glu Gin Thr Phe Met Asp Thr His Ala Leu Gin 
530 535 540 



Thr Pro His Leu Asp Thr Pro Gly Arg Arg Lys Arg Arg Ser Gly Asp 
545 550 555 560 



Leu Leu Pro Gin Gly Ala Thr Leu Leu Arg Trp He Ser Ala Gin Leu 
565 570 575 



His Ser Tyr Gin Phe He Pro Glu Leu Lys Glu Ala Ser Asp Val Phe 
580 585 590 



Arg Asn Gly Arg Val Leu Cys Ala Leu He Asn Arg Tyr Arg Pro Asp 
595 600 605 
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Leu lie Asp Tyr Ala Ala Thr Lys Asp Met Ser Pro Val Glu Cys Asn 
610 615 620 



Glu Leu Ser Phe Ala Val Leu Glu Arg Glu Leu His lie Asp Arg Val 
625 630 635 640 



Met Ser Ala Lys Gin Ser Leu Asp Leu Thr Glu Leu Glu Ser Arg lie 
645 650 655 



Trp Leu Asn Tyr Leu Asp Gin lie Cys Asp Leu Phe Arg Gly Glu lie 
660 665 670 



Pro His lie Lys His Pro Lys Met Asp Phe Ser Asp Leu Arg Gin Lys 
675 680 685 



Tyr Arg lie Asn His Thr His Ala Gin Pro Asp Phe Ser Lys Leu Leu 
690 695 700 



Ala Thr Lys Pro Lys Ala Lys Ser Pro Met Gin Asp Ala Val Asp He 
705 710 715 720 



Pro Thr Thr Val Gin Arg Arg Ser Val Leu Glu Glu Glu Arg Ala Lys 
725 730 735 



Arg Gin Arg Arg His Glu Gin Leu Leu Asn He Gly Gly Gly Ala Ala 
740 745 750 



Gly Ala Ala Ala Gly Val Ala Gly Ser Gly Thr Gly Thr Thr Thr Gin 
755 760 765 



Gly Gin Asn Asp Thr Pro Arg Arg Ser Lys Lys Arg Arg Gin Val Asp 
770 775 780 



Lys Thr Ala Asn He Glu Glu Arg Gin Gin Arg Leu Gin Glu He Glu 
785 790 795 800 



Glu Asn Arg Gin Glu Arg Met Ser Lys Arg Arg Gin Gin Arg Cys His 
805 810 815 



Gin Thr Gin Asn Phe Tyr Lys Ser Leu Gin Leu Leu Gin Ala Gly Lys 
820 825 830 



Leu Leu Arg Glu Gly Gly Glu Ala Gly Val Ala Glu Asp Gly Thr Pro 
835 840 845 
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Phe Glu Asp Tyr Ser lie Phe Leu Tyr Arg Gin Gin Ala Pro Val Phe 
850 855 860 



Asn Asp Arg Val Lys Asp Leu Glu Arg Lys Leu Leu Phe Pro Asp Arg 
865 870 875 880 



Glu Arg Gly Asp lie Pro Ser Ala Leu Pro Arg Thr Ala Asp Glu Gin 
885 890 895 



Phe Ser Asp Arg lie Lys Asn Met Glu Gin Arg Met Thr Gly Arg Gly 
900 905 910 



Gly Leu Gly Gly Asp Lys Lys Pro Lys Asp Leu Met Arg Ala lie Gly 
915 920 925 



Lys lie Asp Ser Asn Asp Trp Asn Val Arg Glu lie Glu Lys Lys lie 
930 935 940 



Glu Leu Ser Lys Lys Thr Glu lie His Gly Pro Lys Gly Arg Glu Lys 
945 950 955 960 



Val Pro Lys Trp Ser Lys Glu Gin Phe Gin Ala Arg Gin His Lys Met 
965 970 975 



Ser Lys Pro Gin Arg Gin Asp Ser Arg Glu Ala Glu Lys Phe Lys Asp 
980 985 990 



lie Asp Gin Thr lie Arg Asn Leu Asp Lys Gin Leu Lys Glu Gly His 
995 1000 1005 



Asn Leu Asp Val Gly Glu Arg Gly Arg Asn Lys Val Ala Ser lie 
1010 1015 1020 



Ala Gly Gin Phe Gly Lys Lys Asp Glu Ala Asn Ser Asp Glu Lys 
1025 1030 1035 



Asn Ala Gly Ser Ser Asn Ala Thr Thr Asn Thr Asn Asn Thr Val 
1040 1045 1050 



He Pro Lys Ser Ser Ser Lys Val Ala Leu Ala Phe Lys Lys Gin 
1055 1060 1065 



Ala Ala Ser Glu Lys Cys Arg Phe Cys Lys Gin Thr Val Tyr Leu 
1070 1075 1080 



Met Glu Lys Thr Thr Val Glu Gly Leu Val Leu His Arg Asn Cys 
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1085 1090 1095 

Leu Lys Cys His His Cys His Thr Asn Leu Arg Leu Gly Gly Tyr 
1100 iio5 mo 

Ala Phe Asp Arg Asp Asp Pro Gin Gly Arg Phe Tyr Cys Thr Gin 
1115 1120 1125 

His Phe Arg Leu Pro Pro Lys Pro Leu Pro Gin Arg Thr Asn Lys 
1130 1135 1140 

Ala Arg Lys Ser Ala Ala Ala Gin Pro Ala Ser Pro Ala Val Pro 
1145 1150 H55 

Pro Thr Ala Gly Ser Val Pro Thr Ala Ala Ala Thr Ser Glu His 
H60 1165 1170 

Met Asp Thr Thr Pro Pro Arg Asp Gin Val Asp Leu Leu Gin Thr 
1175 1180 1185 

Ser Arg Ala Asn Ala Ser Ala Asp Ala Met Ser Asp Asp Glu Ala 
1190 1195 1200 

Asn Val lie Asp Glu His Glu Trp Ser Gly Arg Asn Phe Leu Pro 
1205 1210 1215 

Glu Ser Asn Asn Asp Ser Gin Ser Glu Leu Ser Ser Ser Asp Glu 
1220 1225 1230 

Ser Asp Thr Glu Ser Asp Ser Glu Met Phe Glu Glu Ala Asp Asp 
1235 1240 1245 

Ser Pro Phe Gly Ala Gin Thr Leu Gin Leu Ala Ser Asp Trp lie 
1250 1255 1260 

Gly Lys Gin Tyr Cys Glu Asp Ser Asp Asp Ser Asp Asp Phe Tyr 
1265 1270 1275 

Asp Ser Ser Glu Gly He Ala Asp Asp Gly Lys Asp Asp Thr Glu 
1280 1285 1290 

Gly Glu Glu Phe Lys Lys Ala Arg Glu Leu Arg Arg Gin Glu Val 
1295 1300 1305 



Arg Leu Gin Pro Leu Pro Ala Asn Leu Pro Thr Asp Thr Glu Thr 
1310 1315 1320 
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Glu Val Gin Thr Glu Ser Glu Ser Thr Ser Pro Asp Glu Val Glu 
1325 1330 1335 



Leu Asn Ser Ala Thr Glu lie Ser Thr Asp Ser Glu Phe Asp Asn 
1340 1345 1350 



Asp Glu lie He Arg Gin Ala Pro Lys He Phe He Asp Asp Thr 
1355 1360 1365 



His Leu Arg Lys Pro Thr Lys Val Gin He Lys Ser Thr Met He 
1370 1375 1380 



Gly Pro Asn Ala Ala Ser Ala Gly Leu His Gin Lys Gin Leu Ala 
1385 1390 1395 



Ala Arg Glu Lys Gly Gly Ser Tyr Leu Gin Lys Tyr Gin Pro Gin 
1400 1405 1410 



Pro Pro Leu Ser Gin Phe Lys Pro Leu Val Gin Val Asp Pro Thr 
1415 1420 1425 



Leu Leu He Gly Ser- Gin Arg Ala Pro Leu Gin Asn Pro Arg Pro 
1430 1435 1440 



Gly Asp Tyr Leu Leu Asn Lys Thr Ala Ser Thr Glu Gly He Ala 
1445 1450 1455 



Ser Lys Lys Ser Leu Glu Leu Lys Lys Arg Tyr Leu Leu Gly Glu 
1460 1465 1470 



Pro Ala Asn Gly Asp Lys He Gin Lys Ser Gly Ser Thr Ser Val 
1475 1480 1485 



Leu Asp Ser Arg He Arg Ser Phe Gin Ser Asn He Ser Glu Cys 
1490 1495 1500 



Gin Lys Leu Leu Asn Pro Ser Ser Asp He Ser Ala Gly Met Arg 
1505 1510 1515 



Thr Phe Leu Asp Arg Thr Lys Leu Gly Glu Gly Ser Gin Thr Thr 
1520 1525 1530 



Pro Gly Gin Thr Asn Glu Leu He Arg Ser Ala Thr Ser Asn Val 
1535 1540 1545 
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lie Asn Asp Leu Arg Val Glu Leu Arg lie Gin Lys Thr Asp Ser 
1550 1555 1560 



Ser His Ser Thr Asp Asn Glu Lys Glu Asn Val Phe Val Asn Cys 
1565 1570 1575 



Lys Asn Glu Leu Asn Lys Gly Met Glu Tyr Thr Asp Ala Val Asn 
1580 1585 1590 



Ala Thr Leu Leu Asp Gin Leu Ala Arg Lys Ser Ser Pro Thr Thr 
1595 1600 1605 



Pro Thr Asn Lys Thr Val Val Glu Val lie Asp Leu Val Thr Pro 
1610 1615 1620 



Glu Lys Pro lie Asp He He Asp Leu Thr Ala Leu Glu Thr Pro 
1625 1630 1635 



Lys Lys Gin Leu Val Asp Gly Ser Ala Met Asp Val Asp Glu Arg 
1640 1645 1650 



Leu Thr Pro Asp Ser Asn Lys He Ser Glu Leu Gin Gin Glu Val 
1655 1660 1665 



Lys Glu Glu Pro Lys Pro Asp Val Ser Arg Asp Val Lys Glu Cys 
1670 1675 1680 



He Pro Asp He Leu Gly His He Lys Glu Gly Thr Gly Ser Lys 
1685 1690 1695 



Glu Pro Gly Gly Glu Asp Gin Gin Ser Leu Leu Glu Gin Ser Asp 
1700 1705 1710 



Glu Glu Lys Arg Asp Ser Pro Glu Lys Asp Val Ala Glu His Glu 
1715 1720 1725 



Leu Tyr Glu Pro Asp Ser Val Gin He Gin Val Pro Asn He Pro 
173 0 1735 1740 



Trp Glu Lys Ser Lys Pro Glu Val Met Ser Thr Thr Gly Ser Ser 
1745 1750 1755 



Gly Ser He Cys Ser Ser Ser Asp Ser Ser Ser He Glu Asp He 
1760 1765 1770 



WO 03/066821 



121/161 



PCT/US03/03551 



Gin His Tyr lie Leu Glu Ser Thr Thr Ser Pro Asp Thr Gin Thr 
1775 1780 1785 



Val Gly Gly Lys His Asn Val Pro Arg Leu Glu Val His Asp Thr 
1790 1795 1800 



Ser Gly Ala Leu Met Gin Val Asp Ser Leu Met lie Val Asn Gly 
1805 1810 1815 



Lys Tyr He Gly Asp Pro Glu Asp Val Lys Phe Leu Asp Met Pro 
1820 1825 1830 



Ala Asn Val He Val Pro Pro Ala Pro Ala Leu Lys Thr Asn Glu 
1835 1840 1845 



Leu Asp Met Glu Asp Asp Gin Glu Ala Glu Ala Glu Pro Val Thr 
1850 1855 1860 



Ala Thr Pro Glu Pro Val Glu Cys Thr Val He Glu Ala Glu Arg 
1865 1870 1875 



Arg Val Thr Ala Pro Pro Pro Leu Pro Glu Met Gly Pro Pro Lys 
1880 1885 1890 



Leu Lys Phe Asp Ser Lys Asn Glu Asn Lys He Glu Ser Leu Lys 
1895 1900 1905 



Asn Leu Pro Leu He Val Glu Ser Asn Val Glu His Ser Gin Ala 
1910 1915 1920 



Val Lys Pro He Thr Leu Asn Leu Ser Asn Leu Ala Arg Thr Pro 
1925 1930 1935 



Asp Thr Pro Thr Thr Pro Thr Ala His Asp Ser Asp Lys Thr Pro 
1940 1945 1950 



Thr Gly Glu He Leu Ser Arg Gly Ser Asp Ser Glu Thr Glu His 
1955 1960 1965 



Thr Gly Thr Gly Gin Val Leu Thr Glu Thr Glu Leu Ser Asp Trp 
1970 1975 1980 



Thr Ala Asp Asp Cys He Ser Glu Asn Phe Val Asp Leu Glu Phe 
1985 1990 1995 



Ala Leu Asn Ser Asn Lys Gly Thr He Lys Arg Arg Lys Asp Arg 
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2000 2005 2010 



Arg Arg Ser Gly Ala Ser Lys Leu Pro Ser Gly Asn Glu Val lie 
2015 2020 2025 



His Glu Leu Ala Arg Gin Ala Pro Val Val Gin Met Asp Gly lie 
2030 2035 2040 



Leu Ser Ala lie Asp lie Asp Asp lie Glu Phe Met Asp Thr Gly 
2045 2050 2055 



Ser Glu Gly Ser Cys Ala Glu Ala Tyr Pro Ala Thr Asn Thr Ala 
2060 2065 2070 



Leu lie Gin Asn Arg Gly Tyr Met Glu Tyr lie Glu Ala Glu Pro 
2075 2080 2085 



Lys Lys Thr Thr Arg Lys Ala Ala Pro Pro Ser Ser Tyr Pro Gly 
2090 2095 2100 



Asn Leu Pro Pro Leu Met Thr Lys Arg Asp Glu Lys Leu Gly Val 
2105 2110 2115 



Asp Tyr lie Glu Gin Gly Ala Tyr lie Met His Asp Asp Ala Lys 
2120 2125 2130 



Thr Pro Val Asn Glu Val Ala Pro Ala Met Thr Gin Ser Leu Thr 
2135 2140 2145 



Asp Ser lie Thr Leu Asn Glu Leu Asp Asp Asp Ser Met lie lie 
2150 2155 2160 



Ser Gin Thr Gin Pro Thr Thr Thr Glu Glu Ser Glu Ala Leu Thr 
2165 2170 2175 



Val Val Thr Ser Pro Leu Asp Thr Ser Ser Pro Arg Val Leu Asp 
2180 2185 2190 



Gin Phe Ala Ser Met Leu Ala Ala Gly Lys Gly Asp Ser Thr Pro 
2195 2200 2205 



Ser Ser Ser Glu Gin Gin Pro Lys Thr Ser Thr Val Thr Ser Ser 
2210 2215 2220 



Ser Thr Gly Pro Asn Ser Ser Thr Thr Gly Asn Val Ser Lys Glu 
2225 2230 2235 
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Pro Gin Glu Glu Asp Leu Gin lie Gin Phe Glu Tyr Val Arg Ala 
2240 . 2245 2250 



Leu Gin Gin Arg lie Ser Gin lie Ser Thr Gin Arg Arg Lys Ser 
2255 2260 2265 



Ser Lys Gly Glu Ala Pro Asn Leu Gin Leu Asn Ser Ser Ala Pro 
2270 2275 2280 



Val lie Glu Ser Ala Glu Asp Pro Ala Lys Pro Ala Glu Glu Pro 
2285 2290 2295 



Leu Val Ser Met Arg Pro Arg Thr Thr Ser lie Ser Gly Lys Val 
2300 2305 2310 



Pro Glu lie Pro Thr Leu Ser Ser Lys Leu Glu Glu lie Thr Lys 
2315 2320 2325 



Glu Arg Thr Lys Gin Lys Asp Leu He His Asp Leu Val Met Asp 
2330 2335 2340 



Lys Leu Gin Ser Lys Lys Gin Leu Asn Ala Glu Lys Arg Leu His 
2345 2350 2355 



Arg Ser Arg Gin Arg Ser Leu Leu Thr Ser Gly Tyr Ala Ser Gly 
2360 2365 2370 



Ser Ser Leu Ser Pro Thr Pro Lys Leu Ala Ala Ala Cys Ser Pro 
2375 2380 2385 



Gin Asp Ser Asn Cys Ser Ser Gin Ala His Tyr His Ala Ser Thr 
2390 2395 " 2400 



Ala Glu Glu Ala Pro Lys Pro Pro Ala Glu Arg Pro Leu Gin Lys 
2405 2410 2415 



Ser Ala Thr Ser Thr Tyr Val Ser Pro Tyr Arg Thr Val Gin Ala 
2420 2425 2430 



Pro Thr Arg Ser Ala Asp Leu Tyr Lys Pro Arg Pro Phe Ser Glu 
2435 2440 2445 



His He Asp Ser Asn Ala Leu Ala Gly Tyr Lys Leu Gly Lys Thr 
2450 2455 2460 
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Ala Ser Phe Asn Gly Gly Lys Leu Gly Asp Phe Ala Lys Pro lie 
2465 2470 2475 



Ala Pro Ala Arg Val Asn Arg Gly Gly Gly Val Ala Thr Ala Asp 
2480 2485 2490 



lie Ala Asn lie Ser Ala Ser Thr Glu Asn Leu Arg Ser Glu Ala 
2495 2500 2505 



Arg Ala Arg Ala Arg Leu Lys Ser Asn Thr Glu Leu Gly Leu Ser 
2510 2515 2520 



Pro Glu Glu Lys Met Gin Leu lie Arg Ser Arg Leu His Tyr Asp 
2525 2530 2535 



Gin Asn Arg Ser Leu Lys Pro Lys Gin Leu Glu Glu Met Pro Ser 
2540 2545 2550 



Gly Asp Leu Ala Ala Arg Ala Arg Lys Met Ser Ala Ser Lys Ser 
2555 2560 2565 



Val Asn Asp Leu Ala Tyr Met Val Gly Gin Gin Gin Gin Gin Gin 
2570 2575 2580 



Val Glu Lys Asp Ala Val Leu Gin Ala Lys Ala Ala Asp Phe Thr 
2585 2590 2595 



Ser Asp Pro Asn Leu Ala Ser Gly Gly Gin Glu Lys Ala Gly Lys 
2600 2605 2610 



Thr Lys Ser Gly Arg Arg Pro Lys Asp Pro Glu Arg Arg Lys Ser 
2615 2620 2625 



Leu lie Gin Ser Leu Ser Ser Phe Phe Gin Lys Gly Ser Gly Ser 
2630 2635 2640 



Ala Ala Ser Ser Ser Lys Glu Gin Gly Gly Ala Val Ala Ala Val 
2645 2650 2655 



His Ser Glu Gin Ser Glu Arg Pro Gly Thr Ser Ser Ser Gly Thr 
2660 2665 2670 



Pro Thr lie Ser Asp Ala Ala Gly Gly Gly Gly Gly Gly Gly Gly 
2675 2680 2685 
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Val Phe Ser Arg Phe Arg He Ser Pro Lys Ser Lys Glu Lys Ser 
2690 2695 2700 

Lys Ser Cys Phe Asp Leu Arg Asn Phe Gly Phe Gly Asp Lys Asp 
2705 2710 2715 

Met Leu Val Cys Asn Ala Ala Ser Pro Ala Gly Ala Thr Ser Ala 
2720 2725 27 30 

Ser Gin Lys Asn His Ser Gin Glu Tyr Leu Asn Thr Thr Asn Asn 
2735 2740 2745 

Ser Arg Tyr Arg Lys Gin Thr Asn Thr Ala Lys Pro Lys Pro Glu 
2750 2755 2760 

Ser Phe Ser Ser Ser Ser Pro Gin Leu Tyr He His Lys Pro His 
2765 2770 2775 

His Leu Ala Ala Ala His Pro Ser Ala Leu Asp Asp Gin Thr Pro 
2780 2785 2790 

Pro Pro He Pro Pro Leu Pro Leu Asn Tyr Gin Arg Ser Asp Asp 
2795 2800 2805 

Glu Ser Tyr Ala Asn Glu Thr Arg Glu His Lys Lys Gin Arg Ala 
2810 2815 2820 

He Ser Lys Ala Ser Arg Gin Ala Glu Leu Lys Arg Leu Arg He 
2825 2830 2835 

Ala Gin Glu He Gin Arg Glu Gin Glu Glu He Glu Val Gin Leu 
2840 2845 2850 

Lys Asp Leu Glu Ala Arg Gly Val Leu He Glu Lys Ala Leu Arg 
2855 2860 2865 

Gly Glu Ala Gin Asn He Glu Asn Leu Asp Ala Thr Lys Asp Asn 
2870 2875 2880 

Asp Glu Lys Leu Leu Lys Glu Leu Leu Glu He Trp Arg Asn He 
2885 2890 2895 

Thr Ala Leu Lys Lys Arg Asp Glu Glu Leu Thr He Arg Gin Gin 
2900 2905 2910 



Glu Leu Gin Leu Glu Tyr Arg His Ala Gin Leu Lys Glu Glu Leu 
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2915 2920 2925 



Asn Leu Arg Leu Ser Cys Asn Lys Leu Asp Lys Ser Ser Ala Asp 
2930 2935 2940 



Val Ala Ala Glu Gly Ala lie Leu Asn Glu Met Leu Glu lie Val 
2945 2950 2955 



Ala Lys Arg Ala Ala Leu Arg Pro Thr Ala Ser Gin Leu Asp Leu 
2960 2965 2970 



Thr Ala Ala Gly Ser Ala Ser Thr Ser Ala Glu Ala Thr Gly lie 
2975 2980 2985 



Lys Leu Thr Gly Gin Pro His Asp His Glu Glu Ser lie He 
2990 2995 3000 



<210> 21 

<211> 1048 

<212> PRT 

<213> Mouse 

<400> 21 

Met Ala Ser Pro Ala Ser Thr Asn Pro Ala His Asp His Phe Glu Thr 
15 10 15 



Phe Val Gin Ala Gin Leu Cys Gin Asp Val Leu Ser Ser Phe Gin Gly 
20 25 30 



Leu Cys Arg Ala Leu Gly Val Glu Ser Gly Gly Gly Leu Ser Gin Tyr 
35 40 45 



His Lys He Lys Ala Gin Leu Asn Tyr Trp Ser Ala Lys Ser Leu Trp 
50 55 60 



Ala Lys Leu Asp Lys Arg Ala Ser Gin Pro Val Tyr Gin Gin Gly Gin 
65 70 75 80 



Ala Cys Thr Asn Thr Lys Cys Leu Val Val Gly Ala Gly Pro Cys Gly 
85 90 95 



Leu Arg Ala Ala Val Glu Leu Ala Leu Leu Gly Ala Arg Val Val Leu 
100 105 110 



Val Glu Lys Arg He Lys Phe Ser Arg His Asn Val Leu His Leu Trp 
115 120 125 
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Pro Phe Thr lie His Asp Leu Arg Ala Leu Gly Ala Lys Lys Phe Tyr 
130 135 140 



Gly Arg Phe Cys Thr Gly Thr Leu Asp His lie Ser lie Arg Gin Leu 
145 150 155 160 



Gin Leu Leu Leu Leu Lys Val Ala Leu Leu Leu Gly Val Glu lie His 
165 170 175 



Trp Gly Val Lys Phe Thr Gly Leu Gin Pro Pro Pro Arg Lys Gly Ser 
180 185 190 



Gly Trp Arg Ala Gin Leu Gin Pro Asn Pro Pro Ala Gin Leu Ala Ser 
195 200 205 



Tyr Glu Phe Asp Val Leu lie Ser Ala Ala Gly Gly Lys Phe Val Pro 
210 215 220 



Glu Gly Phe Thr lie Arg Glu Met Arg Gly Lys Leu Ala lie Gly lie 
225 230 235 240 



Thr Ala Asn Phe Val Asn Gly Arg Thr Val Glu Glu Thr Gin Val Pro 
245 250 255 



Glu lie Ser Gly Val Ala Arg lie Tyr Asn Gin Lys Phe Phe Gin Ser 
260 265 270 



Leu Leu Lys Ala Thr Gly He Asp Leu Glu Asn He Val Tyr Tyr Lys 
275 280 285 



Asp Glu Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin Cys Leu Leu 
290 295 300 



Arg Leu Gly Val Leu Arg Gin Asp Leu Ser Glu Thr Asp Gin Leu Leu 
305 310 315 320 



Gly Lys Ala Asn Val Val Pro Glu Ala Leu Gin Arg Phe Ala Arg Ala 
325 330 335 



Ala Ala Asp Phe Ala Thr His Gly Lys Leu Gly Lys Leu Glu Phe Ala 
340 345 350 



Gin Asp Ala Arg Gly Arg Pro Asp Val Ala Ala Phe Asp Phe Thr Ser 
355 360 365 
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Met Met Arg Ala Glu Ser Ser Ala Arg Val Gin Glu Lys His Gly Ala 
370 375 380 



Arg Leu Leu Leu Gly Leu Val Gly Asp Cys Leu Val Glu Pro Phe Trp 
385 390 395 400 



Pro Leu Gly Thr Gly Val Ala Arg Gly Phe Leu Ala Ala Phe Asp Ala 
405 410 415 



Ala Trp Met Val Lys Arg Trp Ala Glu Gly Ala Gly Pro Leu Glu Val 
420 425 430 



Leu Ala Glu Arg Glu Ser Leu Tyr Gin Leu Leu Ser Gin Thr Ser Pro 
435 440 445 



Glu Asn Met His Arg Asn Val Ala Gin Tyr Gly Leu Asp Pro Ala Thr 
450 455 460 



Arg Tyr Pro Asn Leu Asn Leu Arg Ala Val Thr Pro Asn Gin Val Gin 
465 470 475 480 



Asp Leu Tyr Asp Met Met Asp Lys Glu His Ala Gin Arg Lys Ser Asp 
485 490 495 



Glu Pro Asp Ser Arg Lys Thr Thr Thr Gly Ser Ala Gly Thr Glu Glu 
500 505 510 



Leu Leu His Trp Cys Gin Glu Gin Thr Ala Gly Phe Pro Gly Val His 
515 520 525 



Val Thr Asp Phe Ser Ser Ser Trp Ala Asp Gly Leu Ala Leu Cys Ala 
530 535 540 



Leu Val His His Leu Gin Pro Gly Leu Leu Glu Pro Ser Glu Leu Gin 
545 550 555 560 



Gly Met Gly Ala Leu Glu Ala Thr Thr Trp Ala Leu Arg Val Ala Glu 
565 570 575 



His Glu Leu Gly lie Thr Pro Val Leu Ser Ala Gin Ala Val Met Ala 
580 585 590 



Gly Ser Asp Pro Leu Gly Leu lie Ala Tyr Leu Ser His Phe His Ser 
595 600 605 



Ala Phe Lys Asn Thr Ser His Ser Ser Gly Leu Val Ser Gin Pro Ser 
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610 615 620 

Gly Thr Pro Ser Ala lie Leu Phe Leu Gly Lys Leu Gin Arg Ser Leu 
625 630 635 640 

Gin Arg Thr Arg Ala Lys Val Asp Glu Glu Thr Pro Ser Thr Glu Glu 
645 650 655 

Pro Pro Val Ser Glu Pro Ser Met Ser Pro Asn Thr Pro Glu Leu Ser 
660 665 670 

Glu His Gin Glu Ala Gly Ala Glu Glu Leu Cys Glu Leu Cys Gly Lys 
675 68 0 685 

His Leu Tyr lie Leu Glu Arg Phe Cys Val Asp Gly His Phe Phe His 
690 695 700 

Arg Ser Cys Phe Cys Cys His Thr Cys Glu Ala Thr Leu Trp Pro Gly 
705 710 715 720 

Gly Tyr Gly Gin His Pro Gly Asp Gly His Phe Tyr Cys Leu Gin His 
725 730 ' 735 

Leu Pro Gin Glu Asp Gin Lys Glu Ala Asp Asn Asn Gly Ser Leu Glu 
740 745 " 750 

Ser Gin Glu Leu Pro Thr Pro Gly Asp Ser Asn Met Gin Pro Asp Pro 
755 760 765 

Ser Ser Pro Pro Val Thr Arg Val Ser Pro Val Pro Ser Pro Ser Gin 
770 775 780 

Pro Ala Arg Arg Leu lie Arg Leu Ser Ser Leu Glu Arg Leu Arg Leu 
785 790 795 ~ 8 00 

Ser Ser Leu Asn lie lie Pro Asp Ser Gly Ala Glu Pro Pro Pro Lys 
805 810 815 

Pro Pro Arg Ser Cys Ser Asp Leu Ala Arg Glu Ser Leu Lys Ser Ser 
820 825 830 

Phe Val Gly Trp Gly Val Pro Val Gin Ala Pro Gin Val Pro Glu Ala 
835 840 845 



lie Glu Lys Gly Asp Asp Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu 
850 855 860 
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Glu Glu Glu Pro Leu Pro Pro Leu Glu Pro Glu Leu Glu Gin Thr Leu 
865 870 875 880 



Leu Thr Leu Ala Lys Asn Pro Gly Ala Met Thr Lys Tyr Pro Thr Trp 
885 890 895 



Arg Arg Thr Leu Met Arg Arg Ala Lys Glu Glu Glu Met Lys Arg Phe 
900 905 910 



Cys Lys Ala Gin Ala lie Gin Arg Arg Leu Asn Glu lie Glu Ala Thr 
915 920 925 



Met Arg Glu Leu Glu Ala Glu Gly Thr Lys Leu Glu Leu Ala Leu Arg 
930 935 940 



Lys Glu Ser Ser Ser Pro Glu Gin Gin Lys Lys Leu Trp Leu Asp Gin 
945 950 955 960 



Leu Leu Arg Leu He Gin Lys Lys Asn Ser Leu Val Thr Glu Glu Ala 
965 970 975 



Glu Leu Met He Thr Val Gin Glu Leu Asp Leu Glu Glu Lys Gin Arg 
980 985 990 



Gin Leu Asp His Glu Leu Arg Gly Tyr Met Asn Arg Glu Glu Thr Met 
995 1000 1005 



Lys Thr Glu Ala Asp Leu Gin Ser Glu Asn Gin Val Leu Arg Lys 
1010 1015 1020 



Leu Leu Glu Val Val Asn Gin Arg Asp Ala Leu He Gin Phe Gin 
1025 1030 1035 



Glu Glu Arg Arg Leu Arg Glu Met Pro Ala 
1040 1045 



<210> 22 

<211> 1480 

<212> PRT 

<213> Mouse 

<400> 22 

Met Gly Glu Asn Glu Asp Glu Lys Gin Ala Gin Ala Ser Gin Val Phe 
15 10 15 
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Glu Asn Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu Gin Ala Phe 
20 25 30 



Asn lie Leu Thr Cys Leu Leu Asp Leu Asp Pro Leu Asp His Arg Asn 
35 40 45 



Phe Tyr Ser Gin Leu Lys Ser Lys Val Asn Thr Trp Lys Ala Lys Ala 
50 55 60 



Leu Trp His Lys Leu Asp Lys Arg Gly Ser His Lys Glu Tyr Lys Arg 
65 70 75 80 



Gly Lys Ala Cys Ser Asn Thr Lys Cys Leu lie Val Gly Gly Gly Pro 
85 90 95 



Cys Gly Leu Arg Thr Ala lie Glu Leu Ala Tyr Leu Gly Ala Lys Val 
100 105 110 



Val Val Val Glu Lys Arg Asp Thr Phe Ser Arg Asn Asn Val Leu His 
115 120 125 



Leu Trp Pro Phe Thr lie His Asp Leu Arg Gly Leu Gly Ala Lys Lys 
130 135 140 



Phe Tyr Gly Lys Phe Cys Ala Gly Ser lie Asp His lie Ser He Arg 
145 150 155 160 



Gin Leu Gin Leu He Leu Phe Lys Val Ala Leu Met Leu Gly Val Glu 
165 170 175 



Val His Val Asn Val Glu Phe Val Arg Val Leu Glu Pro Pro Glu Asp 
180 185 190 



Gin Glu Asn Gin Lys Val Gly Trp Arg Ala Glu Phe Leu Pro Ala Asp 
195 200 205 



His Ala Leu Ser Asp Phe Glu Phe Asp Val He He Gly Ala Asp Gly 
210 215 220 



His Arg Asn Thr Leu Glu Gly Phe Arg Arg Lys Glu Phe Arg Gly Lys 
225 230 235 240 



Leu Ala He Ala He Thr Ala Asn Phe He Asn Arg Asn Ser Thr Ala 
245 250 255 



Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin 
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260 265 270 



Lys Phe Phe Gin Asp Leu Lys Glu Glu Thr Gly lie Asp Leu Glu Asn 
275 280 285 



He Val Tyr Tyr Lys Asp Ser Thr His Tyr Phe Val Met Thr Ala Lys 
290 295 300 



Lys Gin Ser Leu Leu Asp Lys Gly Val He Leu Asn Asp Tyr He Asp 
305 310 315 320 



Thr Glu Met Leu Leu Cys Ser Glu Asn Val Asn Gin Asp Asn Leu Leu 
325 330 335 



Ser Tyr Ala Arg Glu Ala Ala Asp Phe Ala Thr Asn Tyr Gin Leu Pro 
340 345 350 



Ser Leu Asp Phe Ala He Asn His Asn Gly Gin Pro Asp Val Ala Met 
355 360 365 



Phe Asp Phe Thr Ser Met Tyr Ala Ser Glu Asn Ala Ala Leu Met Arg 
370 375 380 



Glu Arg Gin Ala His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu 
385 390 395 400 



Leu Glu Pro Phe Trp Pro Met Gly Thr Gly Cys Ala Arg Gly Phe Leu 
405 410 415 



Ala Ala Phe Asp Thr Ala Trp Met Val Lys Ser Trp Asp Gin Gly Thr 
420 425 430 



Pro Pro Leu Glu Val Leu Ala Glu Arg Glu Ser Leu Tyr Arg Leu Leu 
435 440 445 



Pro Gin Thr Thr Pro Glu Asn He Asn Lys Asn Phe Glu Gin Tyr Thr 
450 455 460 



Leu Asp Pro Ala Thr Arg Tyr Pro Asn Leu Asn Leu His Cys Val Arg 
465 470 475 480 



Pro His Gin Val Lys His Leu Tyr He Thr Lys Glu Met Asp Arg Phe 
485 490 495 



Pro Leu Glu Arg Trp Gly Ser Val Arg Arg Ser Val Ser Leu Ser Arg 
500 505 510 



WO 03/066821 



133/161 



PCT/US03/03551 



Arg Glu Ser Asp lie Arg Pro Asn Lys Leu Leu Thr Trp Cys Gin Gin 
515 520 525 



Gin Thr Lys Gly Tyr Gin His Val Arg Val Thr Asp Leu Thr Thr Ser 
530 535 540 



Trp Arg Ser Gly Leu Ala Leu Cys Ala lie He His Ser Phe Arg Pro 
545 550 555 560 



Glu Leu He Asn Phe Asp Ser Leu Asn Glu Asp Asp Ala Val Glu Asn 
565 570 575 



Asn. Gin Leu Ala Phe Asp Val Ala Lys Arg Glu Phe Gly He Leu Pro 
580 585 590 



Val Thr Thr Gly Lys Glu Met Ala Ser Thr Gin Glu Pro Asp Lys Leu 
595 600 605 



Ser Met Val Met Tyr Leu Ser Lys Phe Tyr Glu Leu Phe Arg Gly Thr 
610 615 620 



Pro Leu Arg Pro Met Asp Ser Trp Arg Lys Asn Tyr Gly Glu Asn Ala 
625 630 635 640 



Asp Phe Gly Leu Gly Lys Thr Phe He Gin Asn Asn Tyr Leu Asn Leu 
645 650 655 



Thr Leu Pro Arg Lys Arg Thr Pro Arg Val Asp Thr Gin Thr Glu Glu 
660 665 670 



Asn Asp Met Asn Lys Arg Arg Arg Gin Gly Phe Asn His Leu Glu Glu 
675 680 685 



Leu Pro Ser Phe Ser Ser Arg Ser Leu Gly Ser Ser Gin Glu Tyr Ala 
690 695 700 



Lys Glu Ser Gly Ser Gin Asn Lys Val Lys His Met Ala Asn Gin Leu 
705 710 715 720 



Leu Ala Lys Phe Glu Glu Asn Thr Arg Asn Pro Ser Val Val Lys Gin 
725 730 735 



Glu Ser Pro Arg Lys Ala Phe Pro Leu Ser Leu Gly Gly Arg Asp Thr 
740 745 750 
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Cys Tyr Phe Cys Lys Lys Arg Val Tyr Met lie Glu Arg Leu Ser Ala 
755 760 765 



Glu Gly His Phe Phe His Gin Glu Cys Phe Arg Cys Ser Val Cys Ser 
770 775 780 



Ala Thr Leu Arg Leu Ala Ala Tyr Ala Phe Asp Cys Asp Glu Gly Lys 
785 790 795 800 



Phe Tyr Cys Lys Pro His Phe Val His Cys Lys Thr Ser Ser Lys Gin 
805 810 815 



Arg Lys Arg Arg Ala Glu Leu Asn Gin Gin Arg Glu Glu Glu Gly Thr 
820 825 830 



Trp Gin Glu Gin Glu Ala Pro Arg Arg Asp Val Pro Thr Glu Ser Ser 
835 840 845 



Cys Ala Val Ala Ala He Ser Thr Pro Glu Gly Ser Pro Pro Gly Thr 
850 855 860 



Ser Thr Ser Phe Phe Arg Lys Ala Leu Ser Trp Pro Leu Arg Leu Thr 
865 870 875 880 



Arg Gly Leu Leu Asn Leu Pro Gin Ser Leu Leu Arg Trp Met Gin Gly 
885 890 895 



Leu Gin Glu Ala Ala Gly His His Val Arg Asp Asn Ala His Asn Tyr 
900 905 910 



Cys Phe Met Phe Glu Leu Leu Ser Leu Gly Leu Leu Leu Leu Trp Ala 
915 920 925 



Phe Ser Lys Val Leu Ala Ala Met Tyr Arg Glu Ser Glu Glu Ser Leu 
930 935 940 



Glu Asn He Arg Ser Trp Leu Leu Arg Phe He Pro Val Lys Leu Gin 
945 950 955 960 



Met Gly Gin Pro Gly Gly Pro Glu Leu Ser Lys Glu Arg Lys Leu Gly 
965 970 975 



Leu Lys Lys Leu Val Leu Thr Glu Glu Gin Lys Asn Lys Leu Leu Asp 
980 985 990 
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Trp Ser Asp Cys Thr Gin Glu His Lys Thr Gly Glu Gin Leu Ser Gin 
995 1000 1005 



Glu Ser Ala Glu Asn He Arg Gly Gly Ser Leu Lys Pro Thr Cys 
1010 1015 1020 



Ser Ser Thr Leu Ser Gin Ala Val Lys Glu Lys Leu Leu Ser Gin 
1025 1030 1035 



Lys Lys Ala Leu Gly Gly Met Arg Thr Pro Ala Val Lys Ala Pro 
1040 1045 1050 



Gin Glu Arg Glu Val Pro Pro Pro Lys Ser Pro Leu Lys Leu He 
1055 1060 1065 



Ala Asn Ala He Leu Arg Ser Leu Leu His Asn Ser Glu Ala Gly 
1070 1075 1080 



Lys Lys Thr Ser Pro Lys Pro Glu Ser Lys Thr Leu Pro Arg Gly 
1085 1090 1095 



Gin Pro His Ala Arg Ser Phe Ser Leu Arg Lys Leu Gly Ser Ser 
1100 1105 1110 



Lys Asp Gly Asp Gin Gin Ser Pro Gly Arg His Met Ala Lys Lys 
1115 1120 1125 



Ala Ser Ala Phe Phe Ser Leu Ala Ser Pro Thr Ser Lys Val Ala 
1130 1135 1140 



Gin Ala Ser Asp Leu Ser Leu Pro Asn Ser He Leu Arg Ser Arg 
1145 1150 1155 



Ser Leu Pro Ser Arg Pro Ser Lys Met Phe Phe Ser Thr Thr Pro 
1160 1165 1170 



His Ser Lys Val Glu Asp Val Pro Thr Leu Leu Glu Lys Val Ser 
1175 1180 1185 



Leu Gin Asp Ala Thr His Ser Pro Lys Thr Gly Ala Ser His He 
1190 1195 1200 



Ser Ser Leu Gly Leu Lys Asp Lys Ser Phe Glu Ser Phe Leu Gin 
1205 1210 1215 



Glu Cys Lys Gin Arg Lys Asp He Gly Asp Phe Phe Asn Ser Pro 
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1220 1225 1230 



Lys Glu Glu Gly Pro Pro Gly Asn Arg Val Pro Ser Leu Glu Lys 
1235 1240 1245 



Leu Val Gin Pro Val Gly Ser Thr Ser Met Gly Gin Val Ala His 
1250 1255 1260 



Pro Ser Ser Thr Gly Gin Asp Ala His Pro Val Ala Pro Val Thr 
1265 1270 1275 



Glu Ala Thr Ser Ser Pro Thr Ser Ser Ser Ala Glu Glu Glu Ala 
1280 1285 1290 



Asp Ser Gin Leu Ser Leu Arg lie Lys Glu Lys lie Leu Arg Arg 
1295 1300 1305 



Arg Arg Lys Leu Glu Lys Gin Ser Ala Lys Gin Glu Glu Leu Lys 
1310 1315 1320 



Arg Leu His Lys Ala Gin Ala lie Gin Arg Gin Leu Glu Glu Val 
1325 1330 1335 



Glu Glu Arg Gin Arg Thr Leu Ala lie Gin Gly Val Lys Leu Glu 
1340 1345 1350 



Lys Val Leu Arg Gly Glu Ala Ala Asp Ser Gly Thr Gin Asp Glu 
1355 1360 1365 



Ala Gin Leu Leu Gin Glu Trp Phe Lys Leu Val Leu Glu Lys Asn 
1370 1375 1380 



Lys Leu Met Arg Tyr Glu Ser Glu Leu Leu lie Met Ala Gin Glu 
1385 1390 1395 



Leu Glu Leu Glu Asp His Gin Ser Arg Leu Glu Gin Lys Leu Arg 
1400 1405 1410 



Gin Lys Met Leu Lys Asp Glu Gly Gin Lys Asp Glu Asn Asp Leu 
1415 1420 1425 



Lys Glu Glu Gin Glu lie Phe Glu Glu Met Met Gin Val lie Glu 
1430 1435 1440 



Gin Arg Asn Lys Leu Val Asp Ser Leu Glu Glu Gin Arg Val Lys 
1445 1450 1455 
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Glu Arg Thr Gin Asp Gin His Phe Glu Asn Phe Val Leu Ser Arg 
1460 1465 1470 



Gly Cys Gin Leu Ser Arg Thr 
1475 1480 



<210> 


23 


<211> 


1026 


<212> 


PRT 


<213> 


Mouse 


<220> 




<221> 


MI S COFEATURE 


<222> 


(1016) . . (1016) 


<223> 


Xaa is any amino 


<400> 


23 



Met Glu Glu Arg Lys Gin Glu Thr Thr Asn Gin Ala His Val Leu Phe 
15 10 15 



Asp Arg Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu Arg Ala Phe 
20 25 3 0 



Gin Glu Leu Cys Asp His Leu Glu Leu Lys Pro Lys Asp Tyr Arg Ser 
35 40 45 



Phe Tyr His Lys Leu Lys Ser Lys Leu Asn Tyr Trp Lys Ala Lys Ala 
50 55 60 



Leu Trp Ala Lys Leu Asp Lys Arg Gly Ser His Lys Asp Tyr Lys Lys 
65 70 75 80 



Gly Lys Ala Cys Thr Asn Thr Lys Cys Leu lie lie Gly Ala Gly Pro 
85 90 95 



Cys Gly Leu Arg Thr Ala lie Asp Leu Ser Leu Leu Gly Ala Lys Val 
100 105 110 



Val Val He Glu Lys Arg Asp Ala Phe Ser Arg Asn Asn Val Leu His 
115 120 125 



Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly Ala Lys Lys 
130 135 140 



Phe Tyr Gly Lys Phe Cys Ala Gly Ala He Asp His He Ser He Arg 
145 150 155 160 
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Gin Leu Gin Leu lie Leu Leu Lys Val Ala Leu lie Leu Gly lie Glu 
165 170 ~ 175 



lie His Val Asn Val Glu Phe Gin Gly Leu Val Gin Pro Pro Glu Asp 
180 185 190 



Gin Glu Asn Glu Arg lie Gly Trp Arg Ala Leu Val His Pro Lys Thr 
195 200 ' 205 



His Pro Val Ser Glu Tyr Glu Phe Glu Val lie He Gly Gly Asp Gly 
210 215 220 



Arg Arg Asn Thr Leu Glu Gly Phe Arg Arg Lys Glu Phe Arg Gly Lys 
225 230 235 ~ ~ 240 



Leu Ala He Ala He Thr Ala Asn Phe He Asn Arg Asn Thr Thr Ala 
245 250 255 



Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin 
260 265 270 



Lys Phe Phe Gin Glu Leu Arg Glu Thr Thr Gly He Asp Leu Glu Asn 
275 280 285 



He Val Tyr Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys 
290 295 300 



Lys Gin Ser Leu Leu Asp Lys Gly Val He Leu His Asp Tyr Thr Asp 
305 310 315 320 



Thr Glu Leu Leu Leu Ser Arg Glu Asn Val Asp Gin Glu Ala Leu Leu 
325 330 335 



Asn Tyr Ala Arg Glu Ala Ala Asp Phe Ser Thr Gin Gin Gin Leu Pro 
340 345 350 



Ser Leu Asp Phe Ala He Asn His Tyr Gly Gin Pro Asp Val Ala Met 
355 360 365 



Phe Asp Phe Thr Cys Met Tyr Ala Ser Glu Asn Ala Ala Leu Val Arg 
370 375 380 



Glu Gin Asn Gly His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu 
385 390 395 400 
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Leu Glu Pro Phe Trp Pro Met Gly Thr Gly lie Ala Arg Gly Phe Leu 
405 410 415 



Ala Ala Met Asp Ser Ala Trp Met Val Arg Ser Trp Ser Leu Gly Thr 
420 425 430 



Ser Pro Leu Glu Val Leu Ala Glu Arg Glu Ser lie Tyr Arg Leu Leu 
435 440 445 



Pro Gin Thr Thr Pro Glu Asn Val Ser Lys Asn Phe Ser Gin Tyr Ser 
450 455 460 



lie Asp Pro Val Thr Arg Tyr Pro Asn lie Asn lie Asn Phe Leu Arg 
465 470 475 480 



Pro Ser Gin Val Arg His Leu Tyr Asp Ser Gly Glu Thr Lys Asp lie 
485 490 495 



His Leu Glu Met Glu Asn Met Val Asn Pro Arg Thr Thr Pro Lys Leu 
500 505 510 



Thr Arg Asn Glu Ser Val Ala Arg Ser Ser Lys Leu Leu Gly Trp Cys 
515 520 525 



Gin Arg Gin Thr Glu Gly Tyr Ser Gly Val Asn Val Thr Asp Leu Thr 
530 535 540 



Met Ser Trp Lys Ser Gly Leu Ala Leu Cys Ala He He His Arg Tyr 
545 550 555 560 



Arg Pro Asp Leu He Asp Phe Asp Ser Leu Asp Glu Gin Asn Val Glu 
565 570 575 



Lys Asn Asn Gin Leu Ala Phe Asp He Ala Glu Lys Glu Leu Gly He 
580 585 590 



Ser Pro He Met Thr Gly Lys Glu Met Ala Ser Val Gly Glu Pro Asp 
595 600 605 



Lys Leu Ser Met Val Met Tyr Leu Thr Gin Phe Tyr Glu Met Phe Lys 
610 615 620 



Asp Ser Leu Ser Ser Ser Asp Thr Leu Asp Leu Asn Ala Glu Glu Lys 
625 630 635 640 
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Ala Val Leu He Ala Ser Thr Lys Ser Pro He Ser Phe Leu Ser Lys 
645 650 655 

Leu Gly Gin Thr He Ser Arg Lys Arg Ser Pro Lys Asp Lys Lys Glu 
660 665 670 

Lys Asp Ser Asp Gly Ala Gly Lys Arg Arg Lys Thr Ser Gin Ser Glu 
675 680 685 

Glu Glu Glu Pro Pro Arg Ser Tyr Lys Gly Glu Arg Pro Thr Leu Val 
690 695 700 

Ser Thr Leu Thr Asp Arg Arg Met Asp Ala Ala Val Gly Asn Gin Asn 
705 710 715 720 

Lys Val Lys Tyr Met Ala Thr Gin Leu Leu Ala Lys Phe Glu Glu Asn 
725 730 735 

Ala Pro Ala Gin Ser Thr Gly Val Arg Arg Gin Gly Ser He Lys Lys 
740 745 750 

Glu Phe Pro Gin Asn Leu Gly Gly Ser Asp Thr Cys Tyr Phe Cys Gin 
755 760 765 

Lys Arg Val Tyr Val Met Glu Arg Leu Ser Ala Glu Gly Lys Phe Phe 
770 775 780 

His Arg Ser Cys Phe Lys Cys Glu Tyr Cys Ala Thr Thr Leu Arg Leu 
785 790 795 800 

Ser Ala Tyr Ala Tyr Asp He Glu Asp Glu Phe Ser Pro Asn Phe Trp 
805 810 815 

Cys Ser Ala His Tyr His Val Pro Val Ala Leu Pro Ala Thr Val Met 
820 825 830 

Pro Met Cys Leu Leu Tyr His Pro Ser Gin Val Leu Val Cys Leu Glu 
835 840 845 

Gly Gly Pro Ala Phe Met Ser Pro Val Leu Phe Asn Asp Thr Asn Ser 
850 855 860 

Arg Gin Ala Lys Gin Glu Glu Leu Lys Arg Leu His Arg Ala Gin He 
865 870 875 880 



He Gin Arg Gin Leu Glu Gin Val Glu Glu Lys Gin Arg Gin Leu Glu 
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885 890 895 



Glu Arg Gly Val Ala Val Glu Lys Ala Leu Arg Gly Glu Ala Gly Met 
900 905 910 



Gly Lys Lys Asp Asp Pro Lys Leu Met Gin Glu Trp Phe Lys Leu Val 
915 920 925 



Gin Glu Lys Asn Ala Met Val Arg Tyr Glu Ser Glu Leu Met lie Phe 
930 935 940 



Ala Arg Glu Leu Glu Leu Glu Asp Arg Gin Ser Arg Leu Gin Gin Glu 
945 950 955 960 



Leu Arg Glu Arg Met Ala Val Glu Asp His Leu Lys Thr Glu Gly Glu 
965 970 975 



Leu Ser Glu Glu Lys Lys lie Leu Asn Glu Met Leu Glu Val Val Glu 
980 985 990 



Gin Arg Asp Ser Leu Val Ala Leu Leu Glu Glu Gin Arg Leu Arg Glu 
995 1000 1005 



Lys Glu Glu Asp Lys Asp Leu Xaa Ala Ala Met Leu Cys Lys Gly 
1010 1015 1020 



Phe Ser Leu 
1025 



<210> 24 
<211> 476 
<212> PRT 

<213> Anopheles gambiae 
<400> 24 

Glu Met Phe Leu His Phe Cys Ala Ala Thr Thr Met Lys Gin lie Arg 
15 10 15 



Gly Leu Tyr Trp Asn Met Leu Asp Thr lie Gly Leu Arg Pro Gly Pro 
2 0 25 3 0 



Leu Glu Glu Phe Tyr Pro Lys Met Lys Ala Ala lie Arg Asp Trp Arg 
35 40 45 



Ala Gin Ala Leu Phe Lys Lys Phe Asp Ala Arg Ala Ala His Lys Val 
50 55 60 
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Tyr Cys Lys Gly Arg Ala Ala Ser Lys Thr Arg Val Leu lie Val Gly 
65 70 75 80 



Ala Gly Pro Cys Gly Leu Arg Thr Ala lie Asp Ala Gin Leu Leu Gly 
85 90 95 



Ala Lys Val Val Val Val Val Glu Lys Arg Asp Arg lie Ser Arg Asn 
100 105 110 



Asn Val Leu His Leu Trp Pro Phe lie lie His Asp Leu Lys Ala Leu 
115 120 125 



Gly Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ser lie Asp His 
13 0 13 5 14 0 



lie Ser lie Arg Gin Leu Gin Cys lie Leu Leu Lys Val Ala Leu Leu 
145 150 155 160 



Leu Gly Val Glu Met His Glu Gly Val Ser Phe Val Lys Glu lie Glu 
165 170 175 



Pro Gly Asp Gly Tyr Gly Trp Arg Ala Ser Val Ser Pro Glu Asp His 
180 185 190 



Ala Val Ser His Tyr Glu Phe Asp Val Leu lie Gly Ala Asp Gly Lys 
195 200 205 



Arg Asn Thr Leu Glu Gly Phe Gin Arg Lys Glu Phe Arg Gly Lys Leu 
210 215 220 



Ala lie Ala lie Thr Ala Asn Phe He Asn Lys Arg Thr Glu Ala Glu 
225 230 235 240 



Ala Met Val Glu Glu He Ser Gly Val Ala Phe He Phe Asp Gin Pro 
245 250 255 



Phe Phe Lys Ala Leu Tyr Glu Lys Thr Gly Cys Asp Leu Glu Asn He 
260 265 270 



Val Tyr Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys Lys 
275 280 285 



His Ser Leu Leu His Arg Gly Val He He Lys Asp Leu Ser Asp Pro 
290 295 300 
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Ala Glu Leu Leu Ala Pro Ser Asn Val Asp Lys Pro Lys Leu Tyr Glu 
305 310 315 320 



Tyr Ala Arg Asp Ala Ala Asn Phe Ala Thr Lys Tyr Gin Met Pro Asn 
325 330 335 



Leu Glu Phe Ala Val Asn His Tyr Gly Thr Pro Asp Val Ala Val Phe 
340 345 350 



Asp Phe Thr Ser lie Phe Ala Ala His Asn Ser Cys Lys Val Thr Val 
355 360 365 



Arg Lys Asn Tyr Arg Leu Leu Ser Cys Leu Val Gly Asp Ser Leu Leu 
370 375 380 



Glu Pro Phe Trp Pro Thr Gly Ser Gly Cys Ala Arg Gly Phe Leu Ser 
385 390 395 400 



Ser Met Asp Ala Ala Tyr Ala lie Lys Leu Phe Ala Asn Pro Lys Asn 
405 410 415 



Ser Leu Leu Ala Thr lie Ala Gin Arg Glu Ser Val Tyr Arg Leu Leu 
420 425 430 



Gly Gin Thr Thr Pro Glu Asn Leu Asn Arg Ala Phe Gly Ala Tyr Thr 
435 440 445 



Leu Asp Pro Ser Thr Arg Tyr Lys Asn Leu Asn Lys Ala Ser Val Gin 
450 455 460 



lie Gly Gin Val Lys His Leu Leu Asp Thr Asp Asp 

475 



465 




470 


<210> 


25 




<211> 


211 




<212> 


PRT 




<213> 


Ciona 


intestinalis 


<400> 


25 





Asn lie Val Tyr Tyr Lys Gly Glu Thr His Tyr Phe Val Met Thr Ala 
1 5 10 15 



Lys Lys His Ser Leu Val Ser Lys Gly Val Leu Lys Gin Asp Tyr Asp 
20 25 30 



Asn Thr Asn Glu Leu Leu Cys Tyr Asn Asn lie Asp Gin Glu Glu Leu 
35 40 45 
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Met Lys Tyr Ala Lys Gin Ala Ala Asp Phe Ser Thr Arg His Gin Leu 
50 55 60 



Pro His Leu Asp Phe Ala lie Asn Gin Tyr Gly Gin Ser Asp lie Ala 
65 70 75 80 



Leu Phe Asp Phe Thr Cys Met Tyr Ala Ala Glu Asn Ala Ala Leu Phe 
85 90 95 



Arg Glu Thr Tyr Arg Gin Lys Leu Leu Cys Cys Leu Val Gly Asp Ser 
100 105 110 



Leu Leu Glu Pro Phe Trp Pro Met Gly Thr Gly Cys Ala Arg Gly Phe 
115 120 125 



Leu Ala Ala Phe Asp Leu Val Trp Met Thr Lys Gin Leu Ala Leu Lys 
130 135 140 



Arg Lys Cys Ser Asn Tyr Asp Pro Asn Asp Asn Lys Val Glu Leu Ala 
145 150 155 160 



Val Leu Ala Glu Arg Glu Ser lie Tyr Arg Val Leu His Gin Thr Thr 
165 170 175 



Pro Gin Asn Thr Met Lys Asn His Gin Asp Tyr Thr lie Ala Pro Ser 
180 185 190 



Thr Arg Tyr Ala Asn Leu Asn Leu Lys Ala Val Thr Pro Ser Gin Val 
195 200 205 



Lys Pro Leu 
210 



<210> 26 

<211> 252 

<212> PRT 

<213> Danio rerio 

<400> 26 

Phe Arg Arg Lys Glu Phe Arg Gly Lys Leu Ala lie Ala lie Thr Ala 
15 10 15 



Asn Phe lie Asn Arg Asn Thr Thr Ala Glu Ala Lys Val Glu Glu lie 
20 25 30 
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Ser Gly Val Ala Phe lie Phe Asn Gin Lys Phe Phe Gin Asp Leu Arg 
35 40 45 



Glu Ala Thr Gly lie Asp Leu Glu Asn lie Val Tyr Tyr Lys Asp Asp 
50 55 60 



Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin Ser Leu Leu Glu Lys 
65 70 75 80 



Gly Val lie Leu Asp Tyr Ala Asp Thr Glu Met Leu Leu Ser Arg Ala 
85 90 95 



Asn Val Asp Gin Lys Ala Leu Leu Ser Tyr Ala Arg Glu Ala Ala Asp 
100 105 110 



Phe Ser Thr Asn His Gin Leu Pro Lys Leu Asp Phe Ala lie Asn His 
115 120 125 



Tyr Gly Gin Pro Asp Val Ala Met Phe Asp Phe Thr Cys Met Tyr Ala 
130 135 140 



Ser Glu Asn Ala Ala Leu Val Arg Gin Arg Asn Gly His Lys Leu Leu 
145 150 155 160 



Val Ala Leu Val Gly Asp Ser Leu Leu Glu Pro Phe Trp Pro Met Gly 
165 170 175 



Thr Gly lie Ala Arg Gly Phe Leu Ala Ala Met Asp Ser Ala Trp Met 
180 185 190 



Val Arg Ser Trp Ala His Gly Ser Ser Pro Leu Glu Val Leu Ala Glu 
195 200 205 



Arg Glu Ser lie Tyr Arg Leu Leu Pro Gin Thr Thr Pro Glu Asn Val 
210 215 220 



Ser Lys Asn Phe Ser Gin Tyr Ser Val Asp Pro Thr Thr Arg Tyr Pro 
225 230 235 240 



Asn lie Ser Leu His Gin Val Arg Pro Asn Gin Val 
245 250 



<210> 27 

<211> 154 

<212> PRT 

<213> Danio rerio 
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<400> 27 

Ala Asp His Pro Val Ala Asp Tyr Asp Phe Asp Val Val Val Gly Ala 
15 10 15 



Asp Gly Arg Arg Asn Ser Leu Glu Gly Phe Arg Arg Lys Glu Phe Arg 
20 25 30 



Gly Lys Leu Ala lie Ala lie Thr Ala Asn Phe Thr Asn Arg Asn Thr 
35 40 45 



Thr Ala Glu Ala Lys Val Glu Glu lie Ser Gly Val Ala Phe lie Phe 
50 55 60 



Asn Gin Lys Phe Phe Gin Asp Leu Arg Gin Glu Thr Gly lie Asp Leu 
65 70 75 80 



Glu Asn lie Val Tyr Tyr Lys Asp Asn Thr His Tyr Phe Val Met Thr 
85 90 95 



Ala Lys Lys Gin Ser Leu Leu Asp Lys Gly Val lie lie His Asp Tyr 
100 105 110 



lie Asp Thr Glu Ala Leu Leu Asn Ser Glu Asn Val Asn Gin Glu Ala 
115 120 125 



Leu Leu Val Tyr Ala Arg Glu Ala Ala Asp Tyr Ala Thr His Tyr Gin 
130 135 140 



Leu Pro Thr Leu Asp Tyr Ala Met Asn His 
145 150 



<210> 28 
<211> 230 
<212> PRT 

<213> Gallus gallus 
<400> 28 

Leu Phe Asp Arg Phe Val Gin Ala Ser Thr Cys Lys Gly Thr Leu Lys 
15 10 15 



Ala Phe Gin Glu Leu Cys Asp Tyr Leu Glu Leu Lys Pro Lys Asp Tyr 
20 25 30 



Arg Ser Phe Tyr His Lys Leu Lys Ser Lys Leu Asn Tyr Trp Lys Ala 
35 40 45 
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Lys Ala Leu Trp Ala Lys Leu Asp Lys Arg Gly Ser His Lys Asp Tyr 
50 • 55 60 



Lys Lys Gly Lys Ala Cys Ala Asn Thr Lys Cys Leu lie lie Gly Ala 
65 70 75 80 



Gly Pro Cys Gly Leu Arg Thr Ala He Asp Leu Ser Phe Leu Gly Ala 
85 90 95 



Lys Val Val Val He Glu Lys Arg Asp Ala Phe Ser Arg Asn Asn Val 
100 105 110 



Leu His Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly Ala 
115 120 125 



Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ser He Asp His He Ser 
130 135 140 



He Arg Gin Leu Gin Leu He Leu Leu Lys Val Ala Leu He Leu Gly 
145 150 155 160 



He Glu He His Val Asn Val Glu Phe Gin Gly Leu Val Tyr Pro Pro 
165 170 175 



Glu Asp Gin Glu Asn Glu Arg He Gly Trp Arg Ala Leu Val His Pro 
180 185 190 



Lys Thr His Pro Val Ser Glu Tyr Glu Phe Glu Val He He Gly Gly 
195 200 205 



215 220 



210 


Gly Lys Leu Ala He Ala 


225 


230 


<210> 


29 


<211> 


227 


<212> 


PRT 


<213> 


Gallus gallus 


<400> 


29 



Leu Phe Glu His Phe He Arg Ala Arg Gin Cys Gin Glu Val Leu Ser 
15 10 15 



Cys Phe Ala Glu Leu Cys His Gin Leu Gly Leu Arg Gly Asn Gly Leu 
20 25 30 
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Gin Leu Tyr His Ser Leu Lys Ala Ala Leu Asn Phe Trp Ser Ala Lys 
35 40 45 



Ala Leu Trp He Lys Leu Asp Lys Lys Ala Gly His Lys Asp Tyr Asp 
5 0 55 60 



Gin Gly Thr Ala Cys Ala Ser Thr Lys Cys Leu Val Val Gly Ala Gly 
65 70 75 80 



Pro Cys Gly Leu Arg Thr Ala He Glu Leu Ala Leu Leu Gly Ala Arg 
85 90 ' 95 



Val Val Val Leu Glu Lys Arg Asp Ser Phe Ser Arg Asn Asn Val Leu 
100 105 110 



His Leu Trp Pro Phe Thr He His Asp Leu Arg Ala Leu Gly Ala Lys 
115 120 125 



Lys Phe Tyr Gly Arg Phe Cys Thr Gly Thr Leu Asp His lie Ser He 
130 135 140 



Arg Gin Leu Gin Leu He Leu Leu Lys Val Ala Leu Leu Leu Gly Val 
145 150 ~ 155 160 



Glu Val His Thr Lys Val Gin Phe Lys Gly Leu His Pro Pro Thr Gly 
165 170 175 



Lys Ala Ala Gly Gin Gly Gly Trp Arg Ala Val Leu Gin Pro Ser Ser 
180 185 190 



Ser Pro Leu Ser His Tyr Glu Phe Asp Val Leu He Ser Ala Gly Gly 
195 200 205 



Gly Lys Phe Val Pro Glu Asp Phe Lys Arg Lys Glu Met Arg Gly Lys 
210 215 220 



Leu Ala He 
225 



<210> 30 

<211> 467 

<212> PRT 

<213> Rattus norvegicus 



<400> 30 
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Gin Val Phe Glu Asn Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu 
1 5 - 10 ^ 15 

Gin Ala Phe Asn lie Leu Thr Cys Leu Leu Asp Leu Asp Pro Leu Asp 
20 25 30 

His Arg Asn Phe Tyr Thr Gin Leu Lys Ser Lys Val Asn Thr Trp Lys 
35 40 45 

Ala Lys Ala Leu Trp His Lys Leu Asp Lys Arg Gly Ser His Lys Glu 
50 55 60 

Tyr Lys Arg Gly Lys Ala Cys Ser Asn Thr Lys Val Leu He Val Gly 
65 70 75 80 

Gly Gly Pro Cys Gly Leu Arg Thr Ala He Glu Leu Ala Tyr Leu Gly 
85 90 95 

Ala Lys Val Val Val Val Glu Lys Arg Asp Thr Phe Ser Arg Asn Asn 
100 105 110 

Val Leu His Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly 
115 120 125 

Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ser He Asp His He 
13 0 135 14 0 

Ser He Arg Gin Leu Gin Leu He Leu Phe Lys Val Ala Leu Met Leu 
145 150 155 160 

Gly Val Glu He His Val Asn Val Glu Phe Val Arg Val Arg Glu Pro 
1^5 170 " 175 

Pro Lys Asp Gly Trp Arg Ala Glu Phe Leu Pro Ala Asp His Ala Leu 
180 185 190 

Ser Asn Phe Glu Phe Asp Val He He Gly Ala Asp Gly His Arg Asn 
195 200 205 

Thr Leu Glu Phe Arg Arg Lys Glu Phe Arg Gly Lys Leu Ala He Ala 
210 215 220 

He Thr Ala Asn Phe He Asn Arg Asn Ser Thr Ala Glu Ala Lys Val 
225 230 235 ' 240 



Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin Lys Phe Phe Gin 
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245 250 255 

Asp Leu Lys Glu Glu Thr Gly lie Asp Leu Glu Asn.Ile Val Tyr Tyr 
260 265 270 

Lys Asp Ser Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin Ser Leu 
275 280 285 

Leu Asp Lys Gly Val lie Leu Gin Asp Tyr lie Asp Thr Glu Met Leu 
290 295 300 

Leu Cys Ala Glu Asn Val Asn Gin Asp Asn Leu Leu Ser Tyr Ala Arg 
305 310 315 320 

Glu Ala Ala Asp Phe Ala Thr Asn Tyr Gin Leu Pro Ser Leu Asp Phe 
325 330 335 

Ala lie Asn His Asn Gly Gin Pro Asp Val Ala Met Phe Asp Phe Thr 
340 345 350 

Ser Met Tyr Ala Ser Glu Asn Ala Ala Leu Met Arg Glu Arg Gin Ala 
355 360 ~ 365 

His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu Leu Glu Pro Phe 
370 375 ~ 380 

Trp Pro Met Gly Thr Gly Cys Ala Arg Gly Phe Leu Ala Ala Phe Asp 
385 390 395 400 

Thr Ala Trp Met Val Lys Ser Trp Asp Gin Gly Thr Pro Pro Leu Glu 
405 410 415 

Val Leu Ala Glu Arg Glu Ser Leu Tyr Arg Leu Leu Pro Gin Thr Thr 
420 425 430 

Pro Glu Asn lie Asn Lys Asn Phe Glu Gin Tyr Thr Leu Asp Pro Ala 
435 440 445 

Thr Arg Tyr Pro Asn Leu Asn Val His Cys Val Arg Pro His Gin Val 
450 455 460 

Ser Ala Leu 
465 



<210> 31 
<211> 467 
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<212> PRT 

<213> Rattus norvegicus 
<400> 31 

Phe Glu Thr Phe Val Gin Ala Gin Leu Cys Gin Asp Val Leu Ser Ser 
15 10 15 

Phe Gin Gly Leu Cys Arg Ala Leu Gly Val Glu Ser Gly Gly Gly Leu 
20 25 30 

Pro Gin Tyr His Lys lie Lys Ala Gin Leu Asn Tyr Trp Ser Ala Lys 
3 5 4 0 45 

Ser Leu Trp Ala Lys Leu Asp Lys Arg Ala Ser Gin Pro Ala Tyr Gin 
50 55 60 

Gin Gly Gin Ala Cys Thr Asn Thr Lys Val Leu Val Val Gly Ala Gly 
65 70 75 " 80 

Pro Cys Gly Leu Arg Ala Ala Val Glu Leu Ala Leu Leu Gly Ala Arg 
85 90 95 

Val Val Leu Val Glu Lys Arg Thr Lys Phe Ser Arg His Asn Val Leu 
100 105 110 

His Leu Trp Pro Phe Thr He His Asp Leu Arg Ala Leu Gly Ala Lys 
115 120 125 

Lys Phe Tyr Gly Arg Phe Cys Thr Gly Thr Leu Asp His He Ser He 
130 135 140 

Arg Gin Leu Gin Leu Leu Leu Leu Lys Val Ala Leu Leu Leu Gly Val 
145 150 155 160 

Glu He His Trp Gly Phe Thr Phe Thr Gly Leu Gin Pro Pro Pro Lys 
165 170 175 

Lys Gly Gly Ser Gly Trp Arg Ala Arg He Gin Pro Ser Pro Pro Ala 
180 185 190 

Gin Leu Ala Ser Tyr Glu Phe Asp Val Leu He Ser Ala Gly Gly Gly 
195 200 205 

Lys Phe Val Leu Gly Phe Thr He Arg Glu Met Arg Gly Lys Leu Ala 
210 215 220 
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He Gly He Thr Ala Asn Phe Val Asn Gly Arg Thr Val Glu Glu Thr 
225 230 235 240 

Gin Val Pro Glu He Ser Gly Val Ala Arg He Tyr Asn Gin Lys Phe 
245 250 255 

Phe Gin Ser Leu Leu Lys Ala Thr Gly He Asp Leu Glu Asn He Val 
260 265 270 

Tyr Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin 
275 280 285 

Cys Leu Leu Arg Leu Gly Val Leu Arg Gin Asp Leu Pro Glu Thr Asp 
290 295 300 

Gin Leu Leu Gly Lys Ala Asn Val Val Pro Glu Ala Leu Gin Gin Phe 
305 310 315 320 

Ala Arg Ala Ala Ala Asp Phe Ala Thr Gin Gly Lys Leu Gly Lys Leu 
325 330 ~ 335 

Glu Phe Ala Gin Asp Ala Arg Gly Arg Pro Asp Val Ala Ala Phe Asp 
340 345 350 

Phe Thr Ser Met Met Arg Ser Glu Ser Ser Ala Arg He Gin Glu Lys 
355 360 365 

His Gly Ala Arg Leu Leu Leu Gly Leu Val Gly Asp Cys Leu Val Glu 
370 375 380 

Pro Phe Trp Pro Leu Gly Thr Gly Val Ala Arg Gly Phe Leu Ala Ala 
385 390 395 400 

Phe Asp Ala Ala Trp Met Val Lys Arg Trp Ala Glu Gly Thr Gly Pro 
405 410 415 

Leu Glu Leu Leu Ala Glu Arg Glu Ser Leu Tyr Gin Leu Leu Ser Gin 
420 425 430 

Thr Ser Pro Glu Asn Met His Arg Asn Val Ala Gin Tyr Gly Leu Asp 
435 440 445 

Pro Ala Thr Arg Tyr Pro Asn Leu Asn Leu Arg Ala Val Thr Pro Asn 
450 455 460 



Gin Val Arg 
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465 



<210> 32 

<211> 468 

<212> PRT 

<213> Rattus norvegicus 



<400> 32 

Leu Phe Asp Arg Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu Arg 
1 5 10 ~ 15 

Ala Phe Gin Glu Leu Cys Asp His Leu Glu Leu Lys Pro Lys Asp Tyr 
20 25 ~ 30 

Arg Ser Phe Tyr His Lys Leu Lys Ser Lys Leu Asn Tyr Trp Lys Ala 
35 4 0 45 



Lys Ala Leu Trp Ala Lys Leu Asp Lys Arg Gly Ser His Lys Asp Tyr 
50 55 60 



Lys Lys Gly Lys Ala Cys Thr Asn Thr Lys Val Leu He He Gly Ala 
65 70 75 80 

Gly Pro Cys Gly Leu Arg Thr Ala He Asp Leu Ser Leu Leu Gly Ala 
85 90 95 

Lys Val Val Val He Glu Lys Arg Asp Ala Phe Ser Arg Asn Asn Val 
100 105 ~ 110 

Leu His Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly Ala 
115 120 125 



Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ala He Asp His He Ser 
130 135 140 

He Arg Gin Leu Gin Leu He Leu Leu Lys Val Ala Leu He Leu Gly 
145 150 155 160 

He Glu He His Val Asn Val Glu Phe Gin Gly Leu Val Gin Pro Pro 
165 170 175 

Glu Asp Gly He Gly Trp Arg Ala Leu Val His Pro Lys Thr His Pro 
180 185 190 



Val Ser Glu Tyr Glu Phe Glu Val He He Gly Gly Asp Gly Arg Arg 
1^5 2 00 2 05 
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Asn Thr Leu Glu Phe Arg Arg Lys Glu Phe Arg Gly Lys Leu Ala He 
210 215 220 

Ala He Thr Ala Asn Phe He Asn Arg Asn Thr Thr Ala Glu Ala Lys 
225 230 235 240 

Val Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin Lys Phe Phe 
245 250 255 

Gin Glu Leu Arg Glu Ala Thr Gly Gly He Asp Leu Glu Asn He Val 
260 265 270 

Tyr Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin 
275 280 285 

Ser Leu Leu Asp Lys Gly Val He Leu Gin Asp Tyr Thr Asp Thr Glu 
290 295 300 

Leu Leu Leu Ser Arg Glu Asn Val Asp Gin Glu Ala Leu Leu Asn Tyr 
305 310 315 320 

Ala Arg Glu Ala Ala Asp Phe Ser Thr Gin Gin Gin Leu Pro Ser Leu 
325 330 335 

Asp Phe Ala He Asn His Tyr Gly Gin Pro Asp Val Ala Met Phe Asp 
340 345 350 

Phe Thr Cys Met Tyr Ala Ser Glu Asn Ala Ala Leu Val Arg Glu Gin 
355 360 365 

Asn Gly His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu Leu Glu 
370 375 380 



Pro Phe Trp Pro Met Gly Thr Gly He Ala Arg Gly Phe Leu Ala Ala 
385 390 395 400 

Met Asp Ser Ala Trp Met Val Arg Ser Trp Ser Leu Gly Thr Ser Pro 
405 410 ' 415 

Leu Glu Val Leu Ala Glu Arg Arg Glu Ser He Tyr Arg Leu Leu Pro 
420 425 430 



Gin Thr Thr Pro Glu Asn Val Ser Lys Asn Phe Ser Gin Tyr Ser He 
435 440 445 
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Asp Pro val Thr Arg Tyr Pro Asn He Asn He Asn Phe Leu Arg Pro 
450 455 460 

Ser Gin Val Arg 
465 



<210> 33 

<211> 428 

<212> PRT 

<213> Bos taurus 

<400> 33 

Leu Phe Asp Arg Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu Lys 
1 5 10 15 

Ala Phe Gin Glu Leu Cys Asp His Leu Glu Leu Lys Pro Lys Asp His 
20 25 30 

Arg Ser Phe Tyr His Lys Leu Lys Ser Lys Leu Asn Tyr Trp Lys Ala 
35 40 45 

Lys Ala Leu Trp Ala Lys Leu Asp Lys Arg Gly Ser His Lys Asp Tyr 
50 55 60 

Lys Lys Gly Lys Val Cys Thr Asn Thr Lys Val Leu He He Gly Ala 
65 70 75 80 

Gly Pro Cys Gly Leu Arg Thr Ala He Asp Leu Ser Leu Leu Gly Ala 
85 90 95 

Lys Val Val Val He Glu Lys Arg Asp Ala Phe Ser Arg Asn Asn Val 
100 105 HO 

Leu His Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly Ala 
H5 120 125 

Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ala He Asp His He Ser 
130 135 140 

Arg Gin Leu Gin Leu He Leu Leu Lys Val Ala Leu He Leu Gly He 
145 150 155 160 

Glu He His Val Asn Val Glu Phe Arg Gly Leu Val Glu Pro Pro Glu 
165 170 175 



Asp Gly He Gly Trp Arg Ala Leu Val His Pro Lys Thr His Pro Val 
180 185 190 
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Ser Glu Tyr Glu Phe Glu Val He He Gly Gly Asp Gly Arg Arg Asn 
195 200 205 



Thr Leu Glu Phe Arg Arg Lys Glu Phe Arg Gly Lys Leu Ala He Ala 
210 215 220 



He Thr Ala Asn Phe He Asn Arg Asn Thr Thr Ala Glu Ala Lys Val 
225 2 30 235 ' 240 



Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin Lys Phe Phe Gin 
245 250 255 



Glu Leu Arg Glu Ala Thr Gly Gly He Asp Leu Glu Asn He Val Tyr 
260 265 270 



Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin Ser 
275 280 285 



Leu Leu Asp Lys Gly Val He Leu Gin Asp Tyr Ala Asp Thr Glu Leu 
290 295 300 



Leu Leu Ser Arg Glu Asn Val Asp Gin Glu Ala Leu Leu Ser Tyr Ala 
305 310 315 320 



Arg Glu Ala Ala Asp Phe Ser Thr Gin Gin Gin Leu Pro Ser Leu Asp 
325 330 335 



Phe Ala He Asn His Tyr Gly Gin Pro Asp Val Ala Met Phe Asp Phe 
340 345 350 



Thr Cys Met Tyr Ala Ser Glu Asn Ala Ala Leu Val Arg Glu His Asn 
355 360 365 



Gly His Gin Leu Ala Trp Trp Leu Trp Val Gly Gly Asp Ser Leu Arg 
370 375 380 



Glu Ser He Tyr Arg Leu Leu Pro Gin Thr Thr Pro Glu Asn Val Ser 
385 390 395 400 



Lys Asn Phe Ser Gin Tyr Ser He Asp Pro Val Thr Arg Tyr Pro Asn 
405 410 L 415 



Val Asn Val Asn Phe Leu Arg Pro Ser Gin Val Arg 
420 425 
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<210> 34 

<211> 177 

<212> PRT 

<213> Bos taurus 

<400> 34 

lie Thr Ala Asn Phe Val Asn Gly Arg Thr Val Glu Glu Thr Gin Val 
15 10 15 



Pro Glu lie Ser Gly Val Ala Arg He Tyr Asn Gin Ser Phe Phe Gin 
20 25 30 



Ser Leu Leu Lys Ala Thr Gly He Asp Leu Glu Asn lie Val Tyr Tyr 
35 40 45 



Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin Cys Leu 
50 55 60 



Leu Arg Leu Gly Val Leu His Lys Asp Trp Pro Asp Thr Glu Arg Leu 
65 70 75 80 



Leu Gly Ser Ala Asn Val Val Pro Glu Ala Leu Gin Arg Phe Ala Arg 
85 90 95 



Ala Ala Ala Asp Phe Ala Thr His Gly Lys Leu Gly Lys Leu Glu Phe 
100 105 HO 



Ala Arg Asp Ala His Gly Arg Pro Asp Val Ser Ala Phe Asp Phe Thr 
115 120 125 



Ser Met Met Arg Ala Glu Ser Ser Ala Arg Val Gin Glu Arg His Gly 
13 0 13 5 14 0 



Thr Arg Leu Leu Leu Gly Leu Val Gly Asp Cys Leu Val Glu Pro Phe 
145 150 155 160 



Trp Pro Leu Gly Thr Gly Val Ala Arg Gly Phe Leu Ala Ala Phe Asp 
165 170 175 



Ala 



<210>. 35 

<211> 169 

<212> PRT 

<213> Sus scrofa 
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<400> 35 

Ala Lys Val Val Val Val Glu Lys Arg Asp Thr Phe Ser Arg Asn Asn 
1 5 10 15 

Val Leu His Leu Trp Pro Phe Thr lie His .Asp Leu Arg Gly Leu Gly 
20 25 30 

Ala Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ser He Asp His He 
35 4 0 45 

Ser He Arg Gin Leu Gin Leu He Leu Phe Lys Val Ala Leu Leu Leu 
50 55 60 

Gly Val Glu He His Val Asn Val Glu Phe Val Lys Val Leu Glu Pro 
65 70 75 80 

Pro Glu Asp Gin Glu Asn Gin Lys He Gly Trp Arg Ala Glu Phe Leu 
85 90 95 

Pro Ala Asp His Ser Leu Ser Glu Phe Glu Phe Asp Val He He Gly 
100 105 no 

Ala Asp Gly Arg Arg Asn Thr Leu Glu Gly Phe Arg Arg Lys Glu Phe 
115 120 125 

Arg Gly Lys Leu Ala He Ala He Thr Ala Asn Phe He Asn Arg Asn 
130 135 140 

Ser Thr Ala Glu Ala Lys Val Glu Glu He Ser Gly Val Ala Phe He 
14 5 150 155 160 

Phe Asn Gin Lys Phe Phe Gin Asp Leu 
165 



<210> 36 

<211> 468 

<212> PRT 

<213> Pan troglodytes 
<220> 

<221> MIS COFEATURE 

<222> (298) . . (298) 

<223> Xaa is any amino acid 



<400> 36 



Leu Phe Asp Arg Phe Val Gin Ala Thr Thr Cys Lys Gly Thr Leu Lys 
1 5 10 15 
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Ala Phe Gin Glu Leu Cys Asp His Leu Glu Leu Lys Pro Lys Asp Tyr 
20 25 ' 30 

Arg Ser Phe Tyr His Lys Leu Lys Ser Lys Leu Asn Tyr Trp Lys Ala 
35 40 45 

Lys Ala Leu Trp Ala Lys Leu Asp Lys Arg Gly Ser His Lys Asp Tyr 
50 55 60 

Lys Lys Gly Lys Ala Cys Ala Asn Thr Lys Val Leu He He Gly Ala 
65 7 ° 75 80 

Gly Pro Cys Gly Leu Arg Thr Ala He Asp Leu Ser Leu Leu Gly Ala 
85 90 95 

Lys Val Val Val He Glu Lys Arg Asp Ala Phe Ser Arg Asn Asn Val 
100 105 HO 

Leu His Leu Trp Pro Phe Thr He His Asp Leu Arg Gly Leu Gly Ala 
115 120 125 

Lys Lys Phe Tyr Gly Lys Phe Cys Ala Gly Ala He Asp His He Ser 
130 135 140 

He Arg Gin Leu Gin Leu He Leu Leu Lys Val Ala Leu He Leu Gly 
145 150 155 i 6 o 

He Glu He His Val Asn Val Glu Phe Gin Gly Leu He Gin Pro Pro 
165 170 175 

Glu Asp Gly He Gly Trp Arg Ala Leu Val His Pro Lys Thr His Pro 
180 185 190 

Val Ser Glu Tyr Glu Phe Glu Val He He Gly Gly Asp Gly Arg Arg 
I 95 200 205 

Asn Thr Leu Glu Phe Arg Arg Lys Glu Phe Arg Gly Lys Leu Ala He 
210 215 220 

Ala He Thr Ala Asn Phe He Asn Arg Asn Thr Thr Ala Glu Ala Lys 
225 230 235 240 

Val Glu Glu He Ser Gly Val Ala Phe He Phe Asn Gin Lys Phe Phe 
245 250 255 
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Gin Glu Leu Arg Glu Ala Thr Gly Gly Xle Asp Leu Glu Asn He Val 
260 265 270 

Tyr Tyr Lys Asp Asp Thr His Tyr Phe Val Met Thr Ala Lys Lys Gin 
275 280 285 



Ser Leu Leu Asp Lys Gly Val He Leu Xaa Asp Tyr Ala Asp Thr Glu 
290 295 300 



Leu Leu Leu Ser Arg Glu Asn Val Asp Gin Glu Ala Leu Leu Ser Tyr 
305 310 315 320 

Ala Arg Glu Ala Ala Asp Phe Ser Thr Gin Gin Gin Leu Pro Ser Leu 
325 330 335 

Asp Phe Ala He Asn His Tyr Gly Gin Pro Asp Val Ala Met Phe Asp 
340 345 350 

Phe Thr Cys Met Tyr Ala Ser Glu Asn Ala Ala Leu Val Arg Glu Gin 
355 360 365 



Asn Gly His Gin Leu Leu Val Ala Leu Val Gly Asp Ser Leu Leu Glu 
370 375 380 



Pro Phe Trp Pro Met Gly Thr Gly He Ala Arg Gly Phe Leu Ala Ala 
385 390 395 400 

Met Asp Ser Ala Trp Met Val Arg Ser Trp Ser Leu Gly Thr Ser Pro 
4 05 410 ' 415 

Leu Glu Val Leu Ala Glu Arg Arg Glu Ser He Tyr Arg Leu Leu Pro 
420 425 430 

Gin Thr Thr Pro Glu Asn Val Ser Lys Asn Phe Ser Gin Tyr Ser He 
435 440 445 



Asp Pro Val Thr Arg Tyr Pro Asn He Asn Val Asn Phe Leu Arg Pro 
450 455 460 



Ser Gin Val Arg 
465 



<210> 37 

<211> 18 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Amplification primer 
<400> 37 

ggagcagggc cctgtgga 



<210> 38 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amplification primer 

<400> 38 

tgggcatggc cctgttgg 18 



<210> 39 

<211> 7 

<212> PRT 

<213> Drosophila 

<400> 39 

Gly Ala Gly Pro Cys Gly Leu 
1 * 5 



<210> 40 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> FAD-binding domain mutant 

<400> 40 



Trp Ala Trp Pro Cys Trp Leu 
1 5 



